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Introduction . 

A  key  facto  r  invo  Ived  in  breas  t  cancer  de  velopment  and  progress!  on  is  the  estrogen 
receptor  alp  ha  (ER).  Genom  e-wide  computational  stud  ies  on  ER  have  identified  over  70,000 
putative  Estrogen  Response  Elem  ents  (EREs)  in  the  human  genome.  However,  a  genom  e-wide 
functional  study  using  ChIP-Chip,  has  indicated  th  at  less  than  1/10  of  all  putative  ER  binding 
sites  are  recognized  by  the  recep  tor  following  estrogen  stimulation  in  breast  cancer  cells.  These 
sites  are  predom  inantly  distant  from  promoters  but  lie  in  close  proxim  ity  to  ge  nes  active  ly 
transcribed  following  estrogen  (E2)  stimulation. 

Chromatin  condensation  state  directly  im  pacts  gene  expression  level.  In  accordance, 
specific  histone  modifications,  such  as  trim  ethylation  of  histone  H3  on  lysine  4  (H3K4m  e3),  are 
present  at  the  prom  oter  of  activ  ely  expressed  genes.  In  addition,  monomethylation  of  this  sam  e 
residue  (H  3K4mel)  appears  to  define  puta  five  enh  ancer  regions  .W  hether  epigene  tic 
modifications,  such  as  H3K4m  el  or  m  e3,  can  determine  functional  bi  nding  sites  for  ER  and 
therefore  restrict  its  activity  to  a  small  subset  of  its  putative  binding  sites  in  breast  cancer  cells 
remains  ill  defined. 

Body . 

Aiml)  Screen  multiple  epigenetic  modifications  for  their  specific  occurrence  on  functional 
estrogen  receptor  alpha  (ER)  prior  to  its  recruitment. 

This  was  accom  plished  and  identified  the  pr  esence  of  the  mono  and  dim  ethylation  of 
lysine  4  on  histone  3  (H  3K4mel/me2)  on  functional  ER  binding  sites  (Eupien  et  ah,  2008).  The 
dimethylation  of  lysine  9  on  histone  3  (H  3K9me2)  was  found  enrich  ed  on  non-functional 
regulatory  elem  ents  contain!  ng  the  estrogen  responsive  elem  ent  (ERE).  W  e  extended  our 
research  to  dem  onstrate  tha  t  this  assoc  iation  between  ER  binding  and  the  presence  of 
H3K4mel/me2  also  applied  to  other  transcription  factors,  namely  EoxAl  and  API  (Eupien  et  ah, 
2008).  Eurthermore,  we  demonstrated  that  this  epigenetic  signature  defines  the  cell  type-specific 
recruitment  pattern  of  transcription  factors  (Eupien  et  ah,  2008). 
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Aim2)  Test  the  requirement  for  the  epigenetic  modifications  identified  under  the  first 
objective. 

This  was  accomplished  by  over-expressing  the  lysine  demethylase  KDMl  (LSDl)  whieh 
lead  to  a  signifieant  reduetion  in  H3K4  m  ethylation  state.  In  doing  so,  we  r  evealed  the 
requirement  for  the  m  ethylation  of  H3K4  at  distant  regulatory  elem  ents  for  transeription  faetor 
reeruitment  (Lupien  et  ah,  2008). 

Aim3)  Does  ER  over-expression  alter  the  distribution  of  the  epigenetic  signature. 

We  have  demonstrated  that  ER  aetivation  leads  to  the  indu  ction  of  spe  eifie  ep  igenetic 
modifications,  such  as  dim  ethylation  of  arig  inine  17  on  histone  3  (H3R  17me2),  aeetylation  of 
lysine  18  on  histone  3  (H3K18ae)  and  of  lysine  12  on  histone  4  (H4K12ac),  on  a  limited  number 
of  ER  binding  site  aeross  the  genom  e  of  bre  ast  eancer  eells  (Eupien  et  ah,  2009).  More 
importantly,  only  ER  binding  sites  undergoing  epigenetie  ehanges  following  ER  binding 
assoeiate  with  regulation  of  gene  expression  (Eupien  et  ah,  2009).  This  suggest  that  antagonizing 
these  ER  sites  alone  should  be  suffieient  to  bloek  breast  eancer  growth 

We  also  published  a  review  artiel  e  presenting  all  of  thes  e  results  in  e  ontext  with  the  f  ield  of 
breast  and  prostate  eancer  (Eupien  and  Brown,  2009). 

Key  Research  Accomplishments . 

We  identified  the  epigenetie  signature  that  defines  funetional  and  non-funetional 
regulatory  elements 

We  have  dem  onstrated  f  or  the  f  irst  tim  e  th  e  eell-type  speeifie  distribution  of  this 
epigenetie  signature 

We  have  demonstrated  that  th  is  epigenetie  si  gnature  is  required  for  transeription  f  aetor 
recruitment  to  the  genome 

We  have  identified  a  speeifie  novel  therapeutie  target  for  breast  eaneer  as  rem  oval  of  the 
H3K4mel/me2  epigenetie  signature  can  block  transcription  factor  recruitment,  namely  ER. 

We  identified  the  genom  e  wide  distribution  of  the  dim  ethylation  of  arginine  17  on 
histone  3  (H3R17me2)  epigenetie  modifieation  following  ER  aetivation 
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We  identified  that  a  subset  of  E  R  binding  s  ites  assoeiate  with  eh  anges  in  epig  enetie 
modifieations  sueh  as  d  imethylation  of  arginine  17  on  histone  3,  aeetylation  of  lysine  18  on 
histone  3  and  of  lysine  12  on  histone  4  (H3R17me2,  H3K18ae  and  H4K12ae,  respeetively). 

We  revealed  that  only  ER  sites  driving  th  e  estrogenie  response  as  soeiate  with  these 
epigenetie  modifieations  following  aetivation. 

Reportable  Outcomes . 

Mono  and  dim  ethylation  of  lysine  4  on  hi  stone  3  (H3K4m  el  and  m  e2)  is  a  novel 
therapeutie  target  again  st  breas  t  eaneer  as  th  is  epigenetie  signature  defines  the  funetional 
regulatory  elements  driving  transeription  in  breast  eaneers  (Eupien  et  ah,  2008). 

ER  eoativation  and  asso  eiated  epigenetie  remodeling  (H3R17me2,  H3K18ae,  H4K12ae) 
seleetively  oeeurs  on  a  restrieted  subset  of  its  binding  sites  that  driv  e  the  estrogenie  response 
(Eupien  et  ah,  2009). 

Conclusion . 

Methylation  of  histone  H3  lysine  4  is  part  of  the  epigenetie  signature  that  defines  lineage- 
speeific  re  eruitment  site  s  in  chrom  atin  f  or  trans  eription  faetors  sueh  as  ER.  Eurtherm  ore,  the 
pioneer  faetor  EoxAl  translates  this  epigenetie  signature  into  ehanges  in  ehrom  atin  strueture 
thereby  establishing  lineage-speeifie  transeriptional  enhaneers  and  programs. 

Transeriptional  response  to  E2  in  breast  eanee  r  eells  is  dependent  on  the  subset  of  the 
ERa  eistrome  assoeiated  with  eoaetivation  and  epigenetie  remodeling. 
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Introduction . 

A  key  facto  r  invo  Ived  in  breas  t  cancer  de  velopment  and  progress!  on  is  the  estrogen 
receptor  alp  ha  (ER).  Genom  e-wide  computational  stud  ies  on  ER  have  identified  over  70,000 
putative  Estrogen  Response  Elem  ents  (EREs)  in  the  human  genome.  However,  a  genom  e-wide 
functional  study  using  ChIP-Chip,  has  indicated  th  at  less  than  1/10  of  all  putative  ER  binding 
sites  are  recognized  by  the  recep  tor  following  estrogen  stimulation  in  breast  cancer  cells.  These 
sites  are  predom  inantly  distant  from  promoters  but  lie  in  close  proxim  ity  to  ge  nes  active  ly 
transcribed  following  estrogen  (E2)  stimulation. 

Chromatin  condensation  state  directly  im  pacts  gene  expression  level.  In  accordance, 
specific  histone  modifications,  such  as  trim  ethylation  of  histone  H3  on  lysine  4  (H3K4m  e3),  are 
present  at  the  prom  oter  of  activ  ely  expressed  genes.  In  addition,  monomethylation  of  this  sam  e 
residue  (H  3K4me I)  appears  to  define  puta  five  enh  ancer  regions  .W  hether  epigenetic 
modifications,  such  as  H3K4m  el  or  m  e3,  can  determine  functional  bi  nding  sites  for  ER  and 
therefore  restrict  its  activity  to  a  small  subset  of  its  putative  binding  sites  in  breast  cancer  cells 
remains  ill  defined. 
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Body . 

Aiml)  Screen  multiple  epigenetic  modifications  for  their  specific  occurrence  on  functional 
estrogen  receptor  alpha  (ER)  prior  to  its  recruitment. 

This  was  accom  plished  and  identified  the  pr  esenee  of  the  mono  and  dim  ethylation  of 
lysine  4  on  histone  3  (H  3K4mel/me2)  on  funetional  ER  binding  sites  (Lupien  et  ah,  2008).  The 
dimethylation  of  lysine  9  on  histone  3  (H  3K9me2)  was  found  enrieh  ed  on  non-funetional 
regulatory  elem  ents  eontaini  ng  the  estrogen  responsive  elem  ent  (ERE).  W  e  extended  our 
researeh  to  dem  onstrate  tha  t  this  assoe  iation  between  ER  binding  and  the  presenee  of 
H3K4mel/me2  also  applied  to  other  transeription  faetors,  namely  EoxAl  and  API  (Eupien  et  ah, 
2008).  Eurthermore,  we  demonstrated  that  this  epigenetie  signature  defines  the  eell  type-speeifie 
reeruitment  pattern  of  transeription  faetors  (Eupien  et  ah,  2008). 

Aim2)  Test  the  requirement  for  the  epigenetic  modifications  identified  under  the  first 
objective. 

This  was  aeeomplished  by  over-expressing  the  lysine  demethylase  KDMl  (ESDI)  whieh 
lead  to  a  signifieant  reduetion  in  H3K4  m  ethylation  state.  In  doing  so,  we  r  evealed  the 
requirement  for  the  m  ethylation  of  H3K4  at  distant  regulatory  elem  ents  for  transeription  faetor 
reeruitment  (Eupien  et  ah,  2008). 

Aim3)  Does  ER  over-expression  alter  the  distribution  of  the  epigenetic  signature. 

We  have  demonstrated  that  ER  aetivation  leads  to  the  indu  etion  of  spe  eifie  ep  igenetie 
modifieations,  sueh  as  dim  ethylation  of  arig  inine  17  on  histone  3  (H3R  17me2),  aeetylation  of 
lysine  18  on  histone  3  (H3K18ae)  and  of  lysine  12  on  histone  4  (H4K12ae),  on  a  limited  number 
of  ER  binding  site  aeross  the  genom  e  of  bre  ast  eaneer  eells  (Eupien  et  ah,  2009).  More 
importantly,  only  ER  binding  sites  undergoing  epigenetie  ehanges  following  ER  binding 
assoeiate  with  regulation  of  gene  expression  (Eupien  et  ah,  2009).  This  suggest  that  antagonizing 
these  ER  sites  alone  should  be  suffieient  to  bloek  breast  eaneer  growth 

We  also  published  a  review  artiel  e  presenting  all  of  thes  e  results  in  e  ontext  with  the  f  ield  of 
breast  and  prostate  eaneer  (Eupien  and  Brown,  2009). 
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Key  Research  Accomplishments . 

We  identified  the  epigenetic  signature  that  defines  functional  and  non- functional 
regulatory  elements 

We  have  dem  onstrated  f  or  the  f  irst  tim  e  th  e  cell-type  specific  distribution  of  this 
epigenetic  signature 

We  have  demonstrated  that  th  is  epigenetic  si  gnature  is  required  for  transcription  f  actor 
recruitment  to  the  genome 

We  have  identified  a  specific  novel  therapeutic  target  for  breast  cancer  as  rem  oval  of  the 
H3K4mel/me2  epigenetic  signature  can  block  transcription  factor  recruitment,  namely  ER. 

We  identified  the  genom  e  wide  distribution  of  the  dim  ethylation  of  arginine  17  on 
histone  3  (H3R17me2)  epigenetic  modification  following  ER  activation. 

We  identified  that  a  subset  of  E  R  binding  s  ites  associate  with  eh  anges  in  epig  enetic 
modifications  such  as  d  imethylation  of  arginine  17  on  histone  3,  acetylation  of  lysine  18  on 
histone  3  and  of  lysine  12  on  histone  4  (H3R17me2,  H3K18ac  and  H4K12ac,  respectively). 

We  revealed  that  only  ER  sites  driving  th  e  estrogenic  response  as  sociate  with  these 
epigenetic  modifications  following  activation. 
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Reportable  Outcomes . 

Publications:  (Lupien  et  al.,  2008),  (Lupien  et  al.,  2009)  and  (Lupien  and  Brown,  2009) 
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Conclusion . 

Mono  and  dim  ethylation  of  lysine  4  on  hi  stone  3  (H3K4m  el  and  m  e2)  is  a  novel 
therapeutie  target  again  st  breas  t  cancer  as  this  epigenetic  signature  defines  the  functional 
regulatory  elements  driving  transcription  in  breast  cancers  (Lupien  et  ah,  2008). 

ER  coativation  and  asso  ciated  epigenetic  remodeling  (H3R17me2,  H3K18ac,  H4K12ac) 
selectively  occurs  on  a  restricted  subset  of  its  binding  sites  that  driv  e  the  estrogenic  response 
(Lupien  et  ah,  2009). 
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SUMMARY 

Complex  organisms  require  tissue-specific  transcrip¬ 
tional  programs,  yet  little  is  known  about  how  these 
are  established.  The  transcription  factor  FoxAl  is 
thought  to  contribute  to  gene  regulation  through 
its  ability  to  act  as  a  pioneer  factor  binding  to  nucleo- 
somal  DNA.  Through  genome-wide  positional  analy¬ 
ses,  we  demonstrate  that  FoxAl  cell  type-specific 
functions  rely  primarily  on  differential  recruitment  to 
chromatin  predominantly  at  distant  enhancers  rather 
than  proximal  promoters.  This  differential  recruitment 
leads  to  cell  type-specific  changes  in  chromatin  struc¬ 
ture  and  functional  collaboration  with  lineage-specific 
transcription  factors.  Despite  the  ability  of  FoxAl  to 
bind  nucleosomes,  its  differential  binding  to  chroma¬ 
tin  sites  is  dependent  on  the  distribution  of  histone  H3 
lysine  4  dimethylation.  Together,  our  results  suggest 
that  methylation  of  histone  H3  lysine  4  is  part  of  the 
epigenetic  signature  that  defines  lineage-specific 
FoxAl  recruitment  sites  in  chromatin.  FoxAl  trans¬ 
lates  this  epigenetic  signature  into  changes  in  chro¬ 
matin  structure  thereby  establishing  lineage-specific 
transcriptional  enhancers  and  programs. 

INTRODUCTION 

Over  the  course  of  development,  cells  transit  from  a  plurlpotent 
state  to  one  of  many  committed  cell  lineages.  During  this  pro¬ 
cess,  transcription  factor  networks  are  activated  in  order  to  es¬ 
tablish  cell  type-specific  transcriptional  programs  (Son  et  al., 
2005).  FoxAl  (Hepatocyte  Nuclear  Factor  3a),  a  member  of  the 
Forkhead  family  of  winged-helix  transcription  factors,  is  involved 
in  the  development  and  differentiation  of  several  organs  includ¬ 
ing  liver,  kidney,  pancreas,  lung,  prostate,  and  mammary  gland 
(Friedman  and  Kaestner,  2006;  Kouros-Mehr  et  al.,  2006;  Spear 


et  al.,  2006).  In  addition,  high  expression  of  FoxAl  is  commonly 
observed  in  tumors  arising  from  these  organs,  including  prostate 
and  estrogen  receptor  a  (ERa)-positive  breast  tumors  (Lacroix 
and  Leclercq,  2004;  Lin  et  al.,  2002;  Mirosevich  et  al.,  2006). 
Interestingly,  FoxAl  expression  is  a  positive  prognostic  factor 
among  patients  with  ERa-positive  breast  tumors  and  correlates 
with  sensitivity  to  endocrine  therapy  (Badve  et  al.,  2007).  Consis¬ 
tent  with  its  originally  reported  role  as  a  pioneer  factor  involved  in 
liver-specific  gene  expression  (Bossard  and  Zaret,  2000;  Cirillo 
et  al.,  1998;  Gualdi  et  al.,  1996),  FoxAl  acts  as  a  pioneer  factor 
in  the  recruitment  of  ERa  to  several  c/s-regulatory  elements  in 
the  genome  and  subsequent  transcriptional  induction  of  target 
genes  such  as  Cyclin  D1  [CCND1)  in  breast  cancer  cells  (Carroll 
et  al.,  2005;  Eeckhouteet  al.,  2006;  Laganiere  etal.,  2005).  This  is 
mediated  in  part  through  the  chromatin  remodeling  activity  of 
FoxAl  (Cirillo  et  al.,  2002;  Eeckhoute  et  al.,  2006),  reminiscent 
of  its  role  in  the  induction  of  liver-specific  gene  expression  (Fried¬ 
man  and  Kaestner,  2006).  FoxAl  also  interacts  with  the  andro¬ 
gen  receptor  (AR)  in  prostate  cancer  cells  where  it  is  thought  to 
impact  the  regulation  of  AR  target  genes  (Gao  et  al.,  2003). 
Hence,  FoxAl  appears  capable  of  regulating  distinct  transcrip¬ 
tional  programs  in  cells  of  different  lineages.  However,  the 
molecular  bases  for  the  differential  transcriptional  activities  of 
FoxAl  remain  to  be  established.  In  the  present  study,  we  have 
investigated  FoxAl  differential  transcriptional  activities  in  breast 
and  prostate  cancer  cells  and  their  functional  relation  with  the 
epigenome  of  these  cells. 

RESULTS 

Dual  Regulatory  Role  of  FoxAl  in  E2  Signaling 
Revealed  by  Genome-wide  ChIP-chip 

Estrogen  (E2)  stimulation  leads  to  the  establishment  of  specific 
transcriptional  programs  in  ERa-positive  breast  cancer  cells.  To 
address  how  FoxAl  participates  in  this  process  we  initially  per¬ 
formed  an  unbiased  genome-wide  chromatin  immunoprecipita- 
tion  study  using  tiling  microarrays  (ChIP-chip)  to  define  the  reper¬ 
toire  of  FoxAl  -binding  sites,  which  we  define  as  its  “cistrome,”  in 
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Figure  1.  Genome-wide  Identification  of  FoxAf -Binding  Sites  Reveals  Its  Global  Role  in  Control  of  E2  Signaling  in  Breast  Cancer  Cells 

(A)  Overlap  analysis  at  FDR1  %  showing  the  number  of  binding  sites  specific  to  FoxAt  or  ERa  or  shared  between  the  two  factors  in  MCF7  cells. 

(B)  Correlation  between  E2  upreguiated  (left  panel)  or  downregulated  (right  panel)  genes  and  binding  of  either  ERa  only  (ERa  unique),  EoxAt  oniy  (FoxAt  unique), 
both  factors  at  different  sites  (ERa+FoxAt),  or  both  factors  at  a  shared  site  (ERo/FoxAl  overiapping  sites)  within  20  kb  of  the  TSS  of  genes.  Foid  change  is 
presented  for  instances  where  significant  differences  are  observed  between  regulated  (t  test  p  value  <  10“^)  and  nonregulated  genes  (t  test  p  value  >  10“^). 

(C)  Correlation  between  ERa-  and  EoxAt -binding  sites  and  genes  coexpressed  with  EoxAt  in  primary  breast  tumors  (Wang  et  al.,  2005)  were  analyzed  as  in  (B). 
Eold  change  is  presented  for  instances  where  significant  differences  are  observed. 


the  MCF7  breast  cancer  cell  line.  A  total  of  12904  high-confidence 
FoxAl  recruitment  sites  were  identified  in  these  cells  (using  a  strin¬ 
gent  statistical  false  discovery  rate  [FDR]  of  1  %)  (Figures  SI  and 
S2  available  online).  In  comparison,  the  ERa  cistrome  in  MCF7 
cells  (Carroll  et  al.,  2006)  reanalyzed  using  the  MAT  algorithm 
(Johnson  et  al.,  2006)  and  updated  to  the  most  recent  human  ge¬ 
nome  sequence  (FIgl  8)  revealed  5782  high-confidence  sites  (FDR 
1  %)  (Figure  S3).  Interestingly,  the  genomic  distribution  of  FoxAl  - 
binding  sites  was  reminiscent  of  that  of  ERa  (Carroll  et  al.,  2005; 
Lin  et  al.,  2007).  Indeed,  the  majority  of  the  sites  (96.9%)  were 
found  distant  from  the  proximal  1  kilobase  (kb)  promoter  regions 
of  genes  (Figure  S4B).  Accordingly,  this  distribution  contrasted 
with  that  of  RNA  polymerase  II  (RNA  Polll)  (Carroll  et  al.,  2005), 
which  is  found  primarily  at  proximal  promoters  (Figure  S4C).  Com¬ 
paring  the  FoxAl  and  ERa  cistromes  revealed  a  highly  significant 
overlap  with  ^50%-60%  ERa-binding  sites  occurring  on  FoxAl 
occupied  sites  (Figures  1  A,  S5A,  and  S5B).  To  determine  the  func¬ 


tional  significance  of  this  co-binding,  we  subsequently  determined 
the  distribution  of  FoxAl  -  and  ERa-binding  sites  with  regards  to 
E2-regulated  genes  in  MCF7  cells  (Carroll  et  al.,  2006).  Flence, 
we  compared  the  fraction  of  E2-regulated  versus  -nonregulated 
genes  in  MCF7  cells  with  at  least  one  binding  site  specific  to 
ERa  or  FoxAl  or  shared  by  the  two  factors  (as  defined  in  Figure  S5) 
within  20  kb  of  their  transcription  start  site  (TSS).  Importantly, 
E2-upregulated  genes  were  significantly  enriched  compared  to 
nonregulated  genes  near  sites  of  overlapping  ERo/FoxAl  recruit¬ 
ment  (Figure  1 B).  Strikingly,  this  was  also  the  case  for  E2-downre- 
gulated  genes  (Figure  1 B).  These  results  demonstrate  that  genes 
having  enhancers  within  20  kb  of  the  TSS  that  bind  both  ERa 
and  FoxAl  together  compared  to  ERa  or  FoxAl  separately  are 
much  more  likely  to  be  regulated  in  response  to  E2  treatment 
in  breast  cancer  cells.  A  role  for  FoxAl  in  E2-downregulated 
genes  independently  of  its  association  with  ERa  was  also  re¬ 
vealed  through  the  enrichment  for  this  category  of  genes  near 
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sites  recruiting  FoxAl  only  (Figure  1 B).  In  fact,  FoxAl  silencing  in 
MCF7  cells  reduced  the  basal  expression  of  these  genes  to  levels 
equivalent  to  the  reduction  seen  after  E2  treatment  (Figures  S6A 
and  S6B).  This  is  most  likely  a  consequence  of  FoxAl 's  role  in  al¬ 
lowing  for  the  basal  activity  of  enhancers  for  those  genes  (Figures 
S6C  and  S6D).  These  data  indicate  that  FoxAf  controls  the  E2 
response  in  breast  cancer  cells  through  a  combination  of  mecha¬ 
nisms  consisting  of  maintaining  the  basal  expression  of  genes 
repressed  following  hormone  treatment  and  allowing  for  the  in¬ 
duction  of  E2-upregulated  genes  through  a  direct  collaboration 
with  ERa.  Interestingly,  genes  with  FoxAl -binding  sites  within 
20  kb  of  their  TSS  also  had  a  greater  chance  to  be  expressed 
together  with  FoxAl  and  ERa  in  primary  breast  tumors  pointing 
to  the  biological  relevance  of  the  FoxAf  cistrome  beyond  the 
MCF7  cell  line  (Figures  1C,  S7,  and  S8). 

FoxAl  Cell  Type-Specific  Activity  Depends 
on  Differential  Recruitment  to  Chromatin 

Having  shown  that  FoxAl  recruitment  to  the  chromatin  within  the 
MCF7  cell  line  was  correlated  with  the  regulation  of  the  transcrip¬ 
tional  program  specific  to  ERa-positive  breast  tumors,  we  inves¬ 
tigated  how  FoxAl  binding  to  the  ohromatin  relates  to  its  cell- 
specific  functions.  This  was  accomplished  by  comparing  the 
FoxAl  cistromes  originating  from  cell  types  of  different  lineages, 
namely  the  MCF7  breast  cancer  and  LNCaP  prostate  cancer  cell 


Figure  2.  Cell  Type-Specific  Recruitment  of 
FoxAl  Correlates  with  Differential  Gene 
Expression  Patterns 

(A)  c/s-regulatory  element  annotation  system 
(CEAS)  (Ji  et  al.,  2006)  was  used  to  determine 
the  distribution  of  FoxATbinding  regions  identi¬ 
fied  within  chromosomes  8,  11,  and  12  in  MCF7 
and  LNCaP  ceiis  regarding  known  genes. 

(B)  Overiap  anaiysis  at  FDR1  %  showing  the  num¬ 
ber  of  FoxATbinding  sites  specific  to  MCF7  or 
LNCaP  or  shared  between  the  two  ceil  iines. 

(C)  Correiation  between  ceii  type-specific  or 
shared  FoxATbinding  sites  and  genes  coex¬ 
pressed  with  FoxAl  in  primary  breast  (Wang 
et  al.,  2005)  or  prostate  (S.R.  Setlur,  K.D.  Mertz, 
Y.  Hoshida,  F.  Demicheiis,  M.L.,  S.  Perner,  A. 
Sboner,  Y.  Pawitan,  O.  Andren,  L.A.  Johnson, 
et  al.  unpublished  data)  tumors.  The  occurrence 
of  FoxATbinding  sites  within  20  kb  of  the  TSS  of 
FoxAl  coexpressed  genes  was  compared  to  that 
of  non-coexpressed  genes.  Fold  change  is  pre¬ 
sented  for  instances  where  significant  differences 
are  observed. 


lines.  Through  genomic-scale  studies 
performed  across  the  nonrepetitive  re¬ 
gions  of  human  chromosomes  8,  11, 
and  12  using  ChIP-chip  assays,  we  iden¬ 
tified  over  2000  high-confidence  sites  of 
FoxAl  reoruitment  (FDR  1  %)  in  both  cell 
lines.  As  in  MCF7  cells,  these  sites  were 
predominantly  found  at  enhancer  posi¬ 
tions  in  LNCaP  cells  (Figures  2A  and 
S9).  Importantly,  comparison  of  the 
FoxAl  partial  cistromes  in  these  two  cell  lines  revealed  both 
a  significant  number  of  shared  sites  and  an  even  greater  number 
of  cell  type-specific  regions  (Figure  2B).  Indeed,  oomparisons  of 
the  datasets  using  various  cut-offs  indicated  that  the  overlap  did 
not  exceed  55%  and  40%  of  the  MCF7-  and  LNCaP-binding 
sites,  respectively  (Figures  S10A-S10C).  Therefore,  of  all  sites 
identified  in  both  cell  lines  (3932  sites  total),  over  65%  of  them 
correspond  to  regions  of  cell  type-specific  recruitment  (886  sites 
specific  to  MCF7  cells  and  1 654  sites  specific  to  LNCaP  cells). 
The  accuracy  of  these  predictions  was  validated  by  ChlP- 
qPCR  experiments  (Figure  S10D).  Hence,  on  a  genomic  scale 
the  majority  of  FoxAl  recruitment  sites  within  the  chromatin  of 
two  distinct  cellular  lineages  are  cell  type  specific.  These  results 
strongly  suggested  that  FoxAl  might  regulate  differential  tran¬ 
scriptional  programs  as  a  result  of  its  cell  type-specific 
recruitment  pattern  in  MCF7  and  LNCaP  cells. 

We  next  investigated  the  association  of  FoxAl -binding  sites 
unique  to  MCF7  or  LNCaP,  or  sites  shared  between  the  two  cell 
lines,  with  genes  coexpressed  with  FoxAl  in  primary  breast  or 
prostate  tumors.  This  revealed  a  significant  enrichment  of  genes 
coexpressed  with  FoxAl  in  primary  breast  tumors  over  non- 
coexpressed  genes  near  FoxAl  -specific  binding  sites  unique  to 
MCF7  breast  cancer  cells  (Figures  2C  and  S11)  (van  de  Vijver 
et  al.,  2002;  Wang  et  al.,  2005).  Reciprocally,  genes  coexpressed 
with  FoxA  1  in  primary  prostate  tumors  were  significantly  enriched 
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over  non-coexpressed  genes  near  FoxA1  -binding  sites  unique  to 
LNCaP  prostate  cancer  celis  (Figure  2C)  (S.R.  Setiur,  K.D.  Mertz, 
Y.  Ftoshida,  F.  Demicheiis,  M.L.,  S.  Perner,  A.  Sboner,  Y.  Pawitan, 
O.  Andren,  L.A.  Johnson,  et  ai.,  unpubiished  data).  Aitogether, 
these  resuits  demonstrate  that  differentiai  recruitment  is  the 
primary  mechanism  responsibie  for  the  differentiai  function  of 
FoxAf  in  these  two  different  ceii  iineages. 

FoxAl  Alternatively  Collaborates  with  ERa  or  AR 
at  Cell-Specific  Enhancers 

In  order  to  further  characterize  the  functionai  mechanisms 
invoived  in  FoxAl  reguiation  of  the  breast  and  prostate  cancer- 
specific  transcriptionai  programs,  we  monitored  the  transcrip¬ 
tion  factor  binding  motifs  enriched  within  the  common  FoxAf 
recruitment  sites,  as  weii  as  those  unique  to  each  ceii  line.  As  ex¬ 
pected,  the  Forkhead  motif  (FKHR)  was  enriched  in  aii  three  sub¬ 
sets  of  FoxAl  -binding  regions  (Figure  3A).  Converseiy,  we  found 
that  the  recognition  motifs  for  the  nuciear  receptors  ERa  (ERE 
and  ERE  haif-site)  and  AR  (ARE  and  ARE  haif-site)  were  specifi- 
caily  enriched  in  FoxAl -binding  sites  unique  to  MCF7  or  to 
LNCaP  ceiis,  respectiveiy  (Figure  3A).  This  suggested  that  the 
differentiai  FoxAl  recruitment  between  MCF7  and  LNCaP  was 
correlated  with  ceii-specific  transcriptionai  coiiaborations  with 
ERa  or  AR.  This  hypothesis  was  tested  by  comparing  the 
FoxAl  cistrome  on  chromosomes  8,  11,  and  12  from  both  ceii 
iines  to  that  of  AR  in  LNCaP  celis  (Q.W.  and  M.B.,  unpubiished 
data)  and  to  that  of  ERa  in  MCF7  ceiis  (Carroli  et  ai.,  2006).  Inter- 
estingiy,  as  was  the  case  for  ERa,  we  found  that  more  than  haif  of 
AR-binding  sites  in  LNCaP  celis  occurred  on  sites  where  FoxAf 
was  aiso  present  (Figure  3B).  These  data  strongiy  suggest  that 
the  functional  relationship  between  FoxAl  and  AR  previousiy 
demonstrated  at  a  few  model  genes  (Gao  et  ai.,  2003)  in  fact 
extends  to  a  iarge  fraction  of  regions  used  by  this  nuciear  recep¬ 
tor.  Accordingiy,  FoxAl  siiencing  moduiated  the  transcriptionai 
response  to  dihydroxytestosterone  (DFIT)  of  severai  studied 
target  genes  (Figure  SI  2).  Importantly,  the  majority  of  FoxAl - 
binding  sites  overiapping  with  ERa  were  sites  specific  to  MCF7 
ceils,  while  the  majority  of  FoxAl  -binding  sites  overlapping  with 
AR  were  sites  specific  to  LNCaP  cells  (Figure  3B).  These  data 
suggest  that  the  cell  type-specific  recruitment  of  FoxAl  to  the 
chromatin  is  linked  to  breast  and  prostate  cancer  transcriptional 
programs  through  specific  collaborations  with  ERa  in  breast  cells 
and  AR  in  prostate  cells.  Indeed,  these  nuclear  receptors  are 
known  to  be  master  regulators  of  the  behavior  of  a  large  subset 
of  breast  and  prostate  tumors  through  transmission  of  estrogenic 
and  androgenic  signals.  Hence,  we  investigated  the  association 
of  the  different  classes  of  sites  with  genes  regulated  by  E2  in 
MCF7  cells  or  those  regulated  by  DHT  in  LNCaP  cells  (Carroll 
et  al.,  2006;  Wang  et  ai.,  2007).  Only  genes  regulated  by  E2 
were  significantly  enriched  over  nonregulated  genes  near  ERa 
sites  overlapping  with  FoxAl  in  MCF7  cells  (Figure  3C).  In  con¬ 
trast,  genes  regulated  by  DHT  were  specifically  significantly  en¬ 
riched  over  nonregulated  genes  near  AR  sites  overlapping  with 
FoxAl  in  LNCaP  cells  (Figure  3C).  Importantly,  E2  or  DHT  regu¬ 
lated  genes  were  mostly  associated  with  the  cell  type-specific 
FoxAl -binding  sites  overlapping  with  ERa  or  AR  and  not  those 
common  to  both  cell  lines  (100%  for  AR/FoxAl  sites  and  70% 
for  ERa/FoxAl  sites).  Overall,  these  data  clearly  implicate  a  role 


for  FoxAl  in  the  regulation  of  breast-  and  prostate-specific  tran¬ 
scriptional  programs  through  cell-specific  recruitment  and  sub¬ 
sequent  differential  collaboration  with  the  sex  steroid  nuclear  re¬ 
ceptors  ERa  and  AR. 

Differential  recruitment  to  the  chromatin  extends  to  other  tran¬ 
scription  factors  present  in  both  MCF7  and  LNCaP  cells.  Indeed, 
AP-1,  whose  recognition  motif  was  enriched  within  the  FoxAl - 
binding  sites  from  MCF7  and  LNCaP  cells  (Figure  S13A),  was 
found  to  be  corecruited  together  with  FoxAl  at  a  subset  of  its 
cell-specific  binding  sites  (Figure  S13B).  Hence,  these  data 
demonstrate  that  cell-specific  recruitment  also  extends  to  ubiq¬ 
uitously  expressed  transcription  factors  such  as  AP-1  and  sug¬ 
gest  that  this  differential  recruitment  could  also  play  an  important 
role  in  its  well-known  cell-lineage  differential  activities  (Jochum 
et  al.,  2001). 

A  Cell  Type-Specific  Histone  Signature  Correlates 
with  Differential  FoxAf  Recruitment 

The  functional  importance  of  FoxAl  cell-specific  recruitment 
described  above  raises  the  question  as  to  how  FoxAl  is  able 
to  bind  to  distinct  regions  within  the  genome  of  the  MCF7  and 
LNCaP  cells.  Accordingly,  we  first  considered  the  possibility 
that  the  sequence  recognized  by  FoxAl  could  be  different 
between  the  two  cell  lines.  However,  de  novo  motif  analysis 
revealed  that  the  Forkhead  factor  recognition  sequence  en¬ 
riched  within  the  FoxAl -binding  sites  did  not  show  any  signifi¬ 
cant  difference  between  shared  and  cell-specific  binding  regions 
though  it  varied  somewhat  from  the  previously  established  con¬ 
sensus  motif  (Figure  4A).  Therefore,  we  investigated  whether  the 
differential  FoxAl  binding  could  rather  be  linked  to  specific  epi¬ 
genetic  modifications.  First,  we  looked  at  several  repressive  his¬ 
tone  marks  (Bernstein  et  al.,  2007;  Kouzarides,  2007)  and  found 
that  H3K9me2  was  more  highly  enriched  on  sites  not  recruiting 
FoxAl  in  both  cell  lines  although  not  exclusively  found  on  sites 
not  recruiting  FoxAl  (Figures  4B,  4C,  and  SI  4A).  We  then  sought 
to  determine  if  FoxAl  recruitment  was  on  the  other  hand  associ¬ 
ated  with  the  presence  of  active  histone  marks.  Recently,  a  geno¬ 
mic-scale  study  demonstrated  the  occurrence  of  mono-  (mel) 
and  dimethylation  (me2)  of  H3K4  at  active  enhancers  (Heintzman 
et  al.,  2007).  Analyzing  the  presence  of  these  specific  histone 
modifications  at  the  FoxAl  recruitment  sites  revealed  significant 
enrichment  for  H3K4me1  and  me2  in  a  cell  type-specific  manner 
(Figures  4D-4G).  Indeed,  in  MCF7  cells,  FoxAl -binding  sites 
unique  to  MCF7  cells  as  well  as  sites  common  to  both  cell  lines 
were  significantly  mono-  and  dimethylated  on  H3K4  compared 
to  the  LNCaP  unique  FoxAl -binding  sites  (Figures  4D  and  4F). 
On  the  other  hand,  in  LNCaP  cells,  the  LNCaP-specific  FoxAl - 
binding  sites  together  with  the  common  sites  were  significantly 
enriched  for  these  histone  modifications  compared  to  MCF7- 
specific  sites  (Figures  4E-4G).  To  confirm  this  correlation  be¬ 
tween  H3K4  methylation  and  FoxAl  occupancy  on  a  genomic 
scale  we  performed  a  ChIP-chip  analysis  of  H3K4me2  levels 
in  MCF7  cells  across  chromosomes  8,  11,  and  12.  These  data 
revealed  that  on  a  genomic  scale  levels  of  H3K4me2  in  MCF7 
cells  were  indeed  significantly  greater  on  MCF7-specific  or 
shared  FoxAl  recruitment  sites  than  on  LNCaP-specific  ones 
(Figure  4H).  H3K4me2  levels  were  also  significantly  higher  on  re¬ 
gions  with  FoxAl  recognition  motifs  bound  by  FoxAl  compared 
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Figure  3.  FoxAl  Cell  Type-Specific  Binding  Sites  Also  Recruit  Nuclear  Receptors  ERa  or  AR  and  Correlate  with  Regulation  of  Sex  Steroid 
Signaling  in  Breast  and  Prostate  Cancer  Ceils 

(A)  Enrichment  for  the  ERE,  ERE  half-site,  FKHR,  ARE,  and  ARE  half-site  in  the  center  of  the  binding  sites  specific  to  MCF7  cells  (MCF7-only)  or  LNCaP  cells 
(LNCaP-only)  or  shared  between  the  two  cell  lines  (Both).  The  occurrence  of  the  motifs  (N  motifs)  was  normalized  to  the  number  of  sites  in  each  subset  (N  binding 
sites). 

(B)  Venn  diagrams  depicting  the  overlap  between  FoxAl  (red)  and  ERa  (blue)  binding  sites  from  MCF7  cells  together  with  FoxAl  (green)  and  AR  (orange)  binding 
sites  from  LNCaP  cells. 

(C)  Correlation  between  E2  or  DHT  regulated  genes  and  binding  sites  for  FoxAl  and  ERa  in  MCF7  cells  or  for  FoxAl  and  AR  in  LNCaP  cells.  Analyses  were 
performed  as  in  Figure  IB  using  hormone-regulated  or  -nonregulated  genes  from  chromosomes  8,  11,  and  12.  Fold  change  is  presented  for  instances  where 
significant  differences  are  observed  between  regulated  and  nonregulated  genes. 
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Figure  4.  Methylation  Pattern  of  Histone  H3  Lysine  4  Correlates  with  Cell  Type>Specific  FoxA1  Recruitment 

(A)  De  novo  determination  of  the  sequence  recognized  by  FoxA1  within  its  cell  type-specific  or  shared  binding  sites.  Logos  show  the  consensus  sequences  of  the 
enriched  Forkhead  motifs  found  by  de  novo  analyses  within  the  FoxA1  -binding  sites  specific  to  MCF7  (MCF7-only)  or  LNCaP  (LNCaP-only)  cells  or  common  to 
the  two  cell  lines  (Both)  in  comparison  to  the  Transfac  FoxA1  matrix  (http://www.gene-regulation.eom/pub/databases.html#transfac). 

(B-G)  Levels  of  H3K9me2  (B  and  C),  H3K4me1  (D  and  E),  and  H3K4me2  (F  and  G)  on  FoxAl  recruitment  sites  specific  to  MCF7  cells  (MCF7-only)  or  LNCaP  cells 
(LNCaP-only)  or  shared  between  the  two  cell  lines  (Both)  were  determined  by  ChIP-qPCR.  Box  plots  were  generated  from  data  obtained  from  three  independent  ex¬ 
periment  testing  1 1  sites  specific  to  MCF7  cells,  1 2  to  LNCaP  cells,  and  8  common  to  both  cell  types.  Statistical  analyses  of  the  difference  between  the  non-cell  type- 
specific  sites  and  the  other  sites  are  presented,  *:  p  <  0.05  and  **:  p  <  0.01 .  Whiskers  correspond  to  the  largest  and  smallest  nonoutlier  values  from  each  dataset. 
(H)  ChIP-chip  analyses  of  H3K4me2  levels  across  chromosomes  8, 11 ,  and  12  in  MCF7  cells.  Two  independent  ChIP-chip  experiments  were  combined  and  an¬ 
alyzed  using  the  MAT  algorithm.  The  signals  given  by  the  probes  localized  in  the  200  bp  central  regions  of  the  FoxAl  -binding  sites  unique  to  MCF7  (MCF7-only)  or 
LNCaP  (LNCaP-only)  or  shared  (Both)  by  the  two  cell  lines  were  compared  (left  graph).  Similarly,  H3K4me2  levels  at  200  bp  regions  containing  the  FoxAl  rec¬ 
ognition  motif  bound  by  FoxAl  were  compared  to  randomly  selected  FoxAl  -unbound  FoxAl  recognition  motif-containing  regions  (right  graph).  Means  ±  SEM  of 
H3K4me2  levels  given  by  MAT  are  shown  as  well  as  statistically  significant  differences  with  ***  corresponding  to  p  <  0.001 . 
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to  an  equivalent  number  of  randomly  selected  unbound  regions 
with  FoxA1  recognition  motifs  in  MCF7  cells  (Figure  4H).  Impor¬ 
tantly  as  less  than  3.7%  of  sites  harboring  FoxA1  recognition 
motifs  actually  recruit  FoxA1  in  MCF7  cells  (Figure  S14C),  these 
data  derived  from  the  analysis  of  thousands  of  sites  reveal 
a  strong  correlation  between  the  presence  of  FI3K4me2  and 
FoxA1  binding.  Of  the  FoxA1  recruitment  sites  tested,  as  ex¬ 
pected,  very  few  demonstrated  enrichment  for  FI3K4me3  in  ac¬ 
cordance  with  the  predominant  occurrence  of  this  modification 
at  promoters  rather  than  enhancers  (Heintzman  et  al.,  2007) 
(Figure  S14B).  Overall,  these  results  suggest  a  link  between 
FoxA1  recruitment  with  the  presence  of  FI3K4me1  and  me2. 

FoxA1  Is  Required  for  Chromatin  Remodeling 
but  Not  for  H3K4  Methylation 

In  MCF7  cells,  H3K4me1  and  me2  are  detected  at  enhancers 
prior  to  E2  stimulation  and  ERa  binding,  reminiscent  of  FoxA1  re¬ 
cruitment  (Figure  S15).  Accordingly,  ERa  silencing  in  these  cells 
did  not  dramatically  affect  H3K4  methylation  levels  or  FoxA1  re¬ 
cruitment  at  most  sites  where  these  two  factors  are  recruited  (Fig¬ 
ures  5A  and  S16).  Moreover,  the  vast  majority  (~80%)  of  FoxAf 
sites  specific  to  MCF7  cells  do  not  recruit  ERa  (Figure  3B).  Hence, 
while  we  cannot  entirely  rule  out  a  potential  role  for  ERa  in  stabi¬ 
lizing  FoxA1  binding  at  a  small  subset  of  sites,  these  results  sug¬ 
gest  that  in  general  cell-specific  FoxA1  recruitment  occurs  inde¬ 
pendently  of  ERa  action  in  MCF7  cells.  This  raises  the  issue  of 
whether  H3K4me1  and  me2  are  required  for  FoxA1  recruitment 
or  are  induced  as  a  result  of  FoxA1  binding  to  the  chromatin.  This 
question  was  first  addressed  by  investigating  whether  FoxAt 
silencing  would  affect  H3K4  methylation,  chromatin  remodeling, 
or  both  in  MCF7  and  LNCaP  cells.  Consistent  with  its  cel  I  type-spe¬ 
cific  recruitment,  FoxAt  silencing  impacted  the  DNase  I  sensitivity 
only  at  those  sites  to  which  it  was  recruited  (Figure  5B).  Under  these 
conditions,  however,  these  sites  did  not  in  general  show  a  signifi¬ 
cant  reduction  in  the  levels  of  H3K4me1  or  me2  in  either  MCF7 
or  LNCaP  cells  (Figure  5C).  In  fact,  a  significant  increase  in 
H3K4me1  was  detectable  at  most  sites  tested  in  LNCaP  cells.  Sim¬ 
ilarly,  levels  of  H3K9me2  were  unaffected  by  FoxAt  silencing  (Fig¬ 
ure  SI  7).  Overall,  these  data  do  not  favor  a  model  whereby  FoxAt 
recruitment  leads  to  the  induction  of  these  modifications  but  rather 
suggest  an  important  contribution  of  FoxAt  in  opening  genomic 
regions  marked  by  H3K4met  and  me2.  Accordingly,  even  though 
FoxAt  silencing  did  not  modulate  H3K4  methylation  levels  at 
enhancers  (Figure  5D),  it  affected  the  transcriptional  regulation  of 
their  target  genes  (Figures  5E  and  St  8).  Considering  that 
H3K4me2  is  typically  associated  with  gene  transcription  (Bernstein 
et  al.,  2005),  these  results  highlight  the  critical  interplay  between 
the  pioneer  factor  FoxAt  and  H3K4me2  at  enhancers  for  efficient 
gene  regulation. 

Reduction  of  H3K4  Methylation  Impairs 
Cell  Type-Specific  FoxAl  Recruitment 

To  establish  the  capacity  of  H3K4  mono-  or  dimethylation  to 
define  the  cell  type-specific  recruitment  of  FoxAt ,  we  overex¬ 
pressed  the  H3K4met  and  me2  specific  demethylase  KDMt 
(also  known  as  LSDt/BHCttO)  in  MCF7  cells  and  established 
its  impact  on  FoxAt  recruitment  (Shi  et  al.,  2004).  Under  these 
conditions,  H3K4met  was  slightly  reduced  (Figure  St9A)  and 


H3K4me2  was  significantly  lowered  on  FoxAt -binding  sites  (Fig¬ 
ure  6A).  The  level  of  H3K9me2  remained  unchanged  at  these 
sites  (Figure  6C).  Although  FoxAt  protein  levels  were  unaffected 
by  KDMt  overexpression  (Figure  6D),  its  recruitment  to  the  chro¬ 
matin  was  significantly  impaired  (Figure  6B).  Importantly,  no 
global  alteration  in  ChIP  efficiency  was  observed  upon  KDMt 
overexpression  (Figures  S20B  and  S20C).  Hence,  these  re¬ 
sults  suggest  that  H3K4me2  is  required  to  define  the  cell  type- 
specific  regions  competent  for  recruitment  of  FoxAt.  The 
correlation  between  the  presence  of  histone  marks  and  FoxAt , 
ERa,  or  AR  recruitment  is  shown  for  specific  examples  of 
hormone-regulated  genes  (Figure  6E). 

DISCUSSION 

Networks  of  transcription  factors  are  known  to  be  at  the  center  of 
cell  type-specific  transcriptional  programs  that  characterize  dif¬ 
ferent  cell  lineages  (Olson,  2006;  Schrem  et  al.,  2002).  However, 
how  a  particular  transcription  factor  manages  to  regulate  gene 
expression  in  a  cell  type-specific  fashion  within  the  context  of 
different  transcription  factor  networks  is  still  poorly  understood. 
In  particular,  it  is  still  elusive  how  a  pioneer  factor,  such  as  FoxAt , 
that  is  able  to  bind  condensed  chromatin  structures  in  vitro  can 
mediate  differential  gene  regulation  in  vivo  (Cirillo  et  al.,  2002; 
Eeckhoute  et  al.,  2006).  Here,  we  show  that  FoxAt  differential 
transcriptional  activities  in  breast  and  prostate  cells  relies  pri¬ 
marily  on  its  differential  recruitment  to  the  chromatin  and  alterna¬ 
tive  collaboration  with  the  lineage-specific  factors  ERa  or  AR  at 
cell-specific  enhancers  (Figures  6E,  7,  and  S2t).  These  findings 
indicate  that  alternative  transcriptional  programs  depend  both 
on  the  orchestrated  expression  of  a  particular  set  of  collaborat¬ 
ing  transcription  factors  together  with  their  ability  to  bind  cell- 
specific  enhancer  elements  in  the  vicinity  of  their  target  genes. 
Alternatively,  other  transcription  factor  networks  may  primarily 
target  gene  promoters  (Bieda  et  al.,  2006;  Geles  et  al.,  2006). 
This  may  allow  for  a  tight  regulation  of  gene  expression  both  at 
basal  levels  and  in  response  to  stimuli  through  combined  activ¬ 
ities  of  promoter-  and  enhancer-bound  regulatory  complexes 
(Hatzis  and  Talianidis,  2002;  Marr  et  al.,  2006).  Importantly,  we 
found  that  even  ubiquitous  transcription  factors,  such  as  AP-1 , 
show  differential  recruitment  to  cell  type-specific  enhancers. 
Combined  with  other  recent  studies  (So  et  al.,  2007),  this  sug¬ 
gests  that  cell-specific  binding  to  the  chromatin  represents 
a  general  mechanism  for  differential  transcription  factor  regula¬ 
tory  activities.  Cell-specific  recruitment  of  AP-1  to  FoxAl  sites 
could  have  important  functional  implications  in  breast  cells  espe¬ 
cially  for  E2  downregulated  genes  where  FoxAl  -binding  sites  are 
enriched  for  AP-1  and  Spl  motifs  (p  <  0.05)  that  can  tether  ERa 
to  mediate  gene  repression  (Carroll  et  al.,  2006;  Stossi  et  al., 
2006).  Other  important  candidates  for  a  global  role  in  control 
of  sex  steroid  signaling  through  collaborations  with  FoxAl  and 
ERa  or  AR  include  CATA  family  members  (Eeckhoute  et  al., 
2007;  Wang  et  al.,  2007),  c-myc  (Cheng  et  al.,  2006),  and  NFIC 
(Eeckhoute  et  al.,  2006). 

The  occurrence  of  specific  histone  modifications  at  c/s-regula- 
tory  elements  commonly  characterizes  transcriptionally  active  or 
inactive  regions  (Bernstein  et  al.,  2007;  Kouzarides,  2007).  Re¬ 
cently,  the  balance  between  the  presence  of  active  or  repressive 
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Figure  5.  FoxA1  Silencing  Decreases  Chromatin  Accessibility  of  Enhancers  but  Not  H3K4  Methylation  Levels 

(A)  Effect  of  ERa  silencing  on  FoxA1  recruitment.  Eight  sites  recruiting  both  ERa  and  FoxA1  in  MCF7  cells  were  used  to  monitor  the  effect  of  ERa  silencing  on  ERa 
and  FoxA1  recruitment  by  ChlP-qPCR.  Reduction  in  ERa  protein  levels  by  slERa  was  also  demonstrated  by  western  blot  (Figure  S16A). 

(B)  DNasel  sensitivity  assays  were  performed  in  both  MCF7  and  LNCaP  cells,  and  the  percent  change  triggered  by  FoxAl  silencing  from  at  least  three  indepen¬ 
dent  experiments  is  reported.  Data  are  means  ±  standard  deviation  (SD). 

(C)  Effect  of  FoxAl  silencing  on  the  levels  of  H3K4me1  and  me2  at  binding  sites  used  in  the  DNasel  sensitivity  assays  in  both  MCF7  and  LNCaP  cells  from  three 
experiments  is  presented,  *:  p  <  0.05  and  **:  p  <  0.01.  Data  are  means  ±  SD. 

(D  and  E)  Presence  of  H3K4me1/2  at  enhancer  is  not  sufficient  for  transcriptional  regulation  of  BIK  and  CCNDl  in  MCF7  cells.  H3K4me1/2  levels  at  FoxAl  re¬ 
cruiting  enhancers  localized  within  or  nearby  FoxAl  target  genes  were  determined  by  ChIP-qPCR  in  l\/1CF7  cells  transfected  with  siLuc  or  siFoxAl  (D).  Even 
though  FoxAl  silencing  did  not  modulate  the  levels  of  H3K4  methylation,  the  expression  of  the  target  genes  was  significantly  reduced  (E).  Data  are  means  ±  SD. 
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Figure  6.  Role  of  H3K4me2  in  FoxAl  Recruitment  to  the  Chromatin 

(A-C)  Effect  of  KDM1  overexpression  on  H3K4  methylation  (A),  FoxAl  recruitment  (B),  and  H3K9  methylation  (C).  H3K4me2  and  H3K9me2  levels  as  well  as  FoxAl 
recruitment  were  determined  in  control  or  KDM1 -overexpressing  cells  by  ChIP-qPCR.  Box  plots  were  generated  from  data  obtained  for  16  sites.  Results  from  one 
representative  experiment  are  presented  with  the  statistical  analyses  of  the  difference  between  control  and  KDMI  -overexpressing  cells,  **:  p  <  0.01 .  Whiskers 
correspond  to  the  largest  and  smallest  nonoutlier  values  from  each  dataset. 

(D)  Western  blots  showing  KDMI ,  FoxAl ,  and  Calnexin  (Control)  levels  in  MCF7  cells  transfected  with  an  empty  control  plasmid  or  a  plasmid  coding  for  KDMI . 

(E)  Specific  examples  of  genes  regulated  by  E2,  DHT,  or  both  hormones.  One  gene  specifically  regulated  by  E2  in  MCF7  cells  (MCF7-only),  by  DHT  in  LNCaP  cells 
(LNCaP-only),  and  by  both  hormones  in  MCF7  and  LNCaP  cells,  respectively  (both),  is  shown.  E2-  and  DHT-regulated  genes  were  identified  using  expression 
array  analyses  performed  in  MCF7  and  LNCaP  cells,  respectively.  Significantly  regulated  genes  were  determined  using  a  t  test  and  a  p  value  cut-off  of  5  x  1 0“^. 
ERa-,  AR-,  and  FoxAl  -binding  sites  from  ChIP-chip  are  indicated  together  with  the  occurrence  of  histone  modifications  derived  from  ChIP-qPCR  at  these  sites. 
Enrichment  for  the  various  factors  is  presented  by  green  and  red  blocks  in  LNCaP  and  MCF7  cells,  respectively.  White  blocks  indicate  the  absence  of  enrichment 
for  the  ChIPed  factors  or  a  decrease  of  more  than  2-fold  for  histone  marks  in  MCF7  cells  following  KDMI  overexpression.  A  4  kb  wide  view  of  the  probe  signals 
obtained  by  ChIP-chip  for  FoxAl ,  ERa,  and  AR  at  the  analyzed  binding  sites  is  also  shown.  Complete  probe  signal  across  the  three  genes  selected  is  presented  in 
Figure  S21. 
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Figure  7.  Model  of  the  Cell  Type-Specific  Interplay  between  the 
Epigenetic  Signature  and  FoxA1  for  the  Establishment  of  Lineage- 
Specific  Transcriptional  Programs 

Schematic  representation  of  how  FoxA1  recruitment  occurs  primarily  on 
H3K9me2-poor  but  H3K4me1/2-rich  regions.  H3K4me1/2  could  guide 
FoxAl  cell  type-specific  recruitment  through  direct  physical  interactions. 
FoxAl  regulation  of  differential  transcriptional  programs  is  subsequently 
achieved  through  transcriptional  collaborations  with  cell  type-specific  (ERa 
and  AR)  as  well  as  ubiquitously  expressed  (AP-1)  transcription  factors. 


histone  modifications  (trimethyiation  of  H3K4  and  H3K27)  has 
been  shown  to  correiate  with  promoter  activity  (Azuara  et  ai., 
2006;  Bernstein  et  ai.,  2006;  Mikkeisen  et  ai.,  2007).  Here,  we 
show  that  the  cell  type-specific  activity  of  enhancers  correlates 
with  the  presence  of  the  positive  mark  H3K4me2,  previously 
shown  to  be  distributed  in  a  cell  type-specific  manner  (Bernstein 
et  ai.,  2005),  while  inactive  enhancers  lack  H3K4me2  and  harbor 
higher  levels  of  the  repressive  mark  H3K9me2.  Interestingly, 
even  though  FoxAl  silencing  does  not  modulate  levels  of  H3K4 
and  K9  methylation  at  enhancers  (Figures  5  and  S1 7),  it  is  required 
for  their  activity  and  consequently  for  their  target  gene  transcrip¬ 
tional  regulation  (Figures  5,  S6,  and  S18).  Therefore,  H3K4me1/2 
appearto  correlate  with  competent  enhancers  but  not  necessarily 
with  transcriptional  activation  of  target  genes  that  requires  factors 
such  as  FoxAl  to  activate  the  functionality  of  these  enhancers. 

The  capacity  of  FoxAl  to  bind  unique  binding  sites  in  reconsti¬ 
tuted  chromatin  has  been  studied  extensively  in  vitro  (Cirillo  et  ai.. 


1 998,  2002;  Sekiya  and  Zaret,  2007).  Under  these  conditions,  no 
histone  modifications  appear  to  be  required  for  FoxAt  recruit¬ 
ment.  However,  our  results  demonstrate  that  in  vivo  FoxAl  actu¬ 
ally  occupies  only  a  very  small  fraction  of  all  its  potential  recogni¬ 
tion  motifs  found  in  the  genome  (less  than  3.7%).  Moreover,  this 
limited  number  of  occupied  sites  is  significantly  different  be¬ 
tween  two  different  cell  types.  Therefore,  although  FoxAt  can 
act  as  a  pioneer  factor  able  to  bind  to  condensed  chromatin, 
we  show  here  that  in  vivo  its  pioneer  function  is  limited  to  a  small 
subset  of  sites  that  are  largely  cell  type  specific.  Our  data  further 
define  on  a  genomic  scale  the  chromatin  components  involved  in 
directing  FoxAl  recruitment  to  this  subset  of  its  potential  binding 
sites.  Indeed,  our  results  point  to  an  important  role  of  active  and 
repressive  histone  marks,  notably  H3K4me2  and  H3K9me2,  re¬ 
spectively,  in  guiding  FoxAl  recruitment.  These  data  indicate 
that  a  better  understanding  of  cell-lineage  transcriptional  com¬ 
mitment  will  require  the  study  of  how  these  marks  are  established 
and  how  they  regulate  recruitment  of  pioneer  transcription  fac¬ 
tors  such  as  FoxAt .  Altogether,  our  data  reveal  an  additional 
layer  of  complexity  in  the  regulation  of  FoxAl  recruitment  to 
chromatin  in  vivo  that  goes  beyond  the  mere  presence  of  its  re¬ 
cognition  motif.  Indeed,  FoxAl  translates  an  epigenetic  sig¬ 
nature  into  functional  cell  type-specific  enhancers  leading  to  the 
establishment  of  cell  type-specific  transcriptional  programs. 

EXPERIMENTAL  PROCEDURES 

ChIP-chip  and  ChIP-qPCR 

ChIP-chip  experiments  using  Affymetrix  Human  Tiling  2. OR  Array  Set  were  per¬ 
formed  as  previously  described  (Carroll  et  al.,  2005, 2006).  For  each  ChIP-chip 
experiment,  at  least  three  independent  assays  were  performed.  Analyses  were 
performed  using  MAT  (Johnson  et  at,  2006),  whose  probe  mapping  had  been 
updated  to  the  latest  human  genomic  sequence  (Hg18).  We  used  statistical 
FDR  as  cut-off  in  those  analyses.  All  ChIP-chip  data  used  in  this  study  can 
be  accessed  at  http://research.dfci.harvard.edu/brownlab/datasets/.  ChlP- 
qPCR  experiments  were  performed  as  in  Carroll  et  al.  (2005).  Statistical  anal¬ 
yses  were  performed  using  Student’s  t  test  comparison  for  unpaired  data. 
Primer  sequences  can  be  found  in  Table  St . 

Antibodies  used  for  ChIP  experiments  were  FoxAl  (Ab5089  and  Ab23738 
from  Abeam,  FOX1  from  CeMines),  ERa  (Ab-10  from  Neomarkers,  HC-20 
from  Santa  Cruz),  pan-jun  (D  from  Santa  Cruz),  pan-fos  (K-25  from  Santa 
Cruz)  (Schwartz  et  al.,  2007),  AR  (N20  from  Santa  Cruz),  H3K4me1 ,  me2,  me3, 
H3K9me1,  me2,  me3,  H4K20me1,  me2,  me3  (Ab8895,  Ab7766,  Ab8580, 
Ab9045,  Ab1220,  Ab8898,  Ab9051,  Ab9052,  and  Ab9053,  respectively,  from 
Abeam)  (Mikkeisen  et  at,  2007;  Barski  et  at,  2007),  H3K27me1,  me2,  me3 
(07-448,  07-449,  and  07-452  from  Upstate  Biotechnology  Inc.)  (Barski  et  al., 
2007;  Mikkeisen  et  at,  2007;  Vakoc  et  at,  2006),  RNA  Polll  (H-224  from  Santa 
Cruz  and  Ab5408  from  Abeam),  H3  (Ab1791  from  Abeam),  and  AcH4  (from 
Cell  Signaling). 

Genomic  Distribution  and  Binding  Site  Overlap 

Genomic  distribution  of  binding  sites  identified  by  ChIP-chip  was  performed 
using  c/s-regulatory  element  annotation  system  (CEAS)  (Ji  et  at,  2006).  Two 
binding  sites  were  considered  to  overlap  as  long  as  they  had  one  base  pair 
in  common.  The  average  size  of  the  ChIP-chip  regions  being  1  kb,  this  means 
that  the  center  of  the  two  binding  sites  had  to  be  in  average  within  1  kb  of  each 
other  to  be  considered  overlapping. 

Transcription  Factor  Recognition  Motif  Enrichment  Analysis 

Known  DNA  motifs  that  are  enriched  relative  to  the  center  of  ChIP-chip  sites 
were  identified  using  the  following  statistic.  All  sites  were  trimmed  or  expanded 
to  600  bp  centered  at  the  middle  point  of  the  identified  ChIP-enriched  regions. 
All  subsequences  within  the  trimmed  regions  were  scored  by  a  TRANSPAC 
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motif  (Matys  et  al.,  2006)  and  the  genomic  background  sequence  composition 
to  identify  hits  above  certain  relative  entropy  cutoff  t.  Letting  x,-,  a  value 
between  0  and  1 ,  denote  the  relative  location  of  motif  hit  /  on  the  ChIP  regions 
(0  and  1  representing  the  center  and  edge  of  a  ChIP  region,  respectively),  out  of 
N  total  motif  hits,  we  define  a  z  score,  z=  —  0.5)/'\/A/7T2  to  assess 

the  positional  bias  of  a  motif  toward  the  centers  of  the  regions.  Different  integer 
cutoffs  f  >  3  were  tested  for  each  motif,  and  the  cutoff  resulting  in  the  highest  z 
was  selected.  This  statistic  is  based  on  the  assumptions  that  insignificant  DNA 
motifs  will  be  uniformly  distributed  across  the  ChIP  regions  and  the  null  distri¬ 
bution  of  Y^Xi  can  be  estimated  as  the  N-fold  convolution  of  uniform  density 
functions.  In  Figure  3A,  a  Gaussian  kernel  was  used  to  smooth  the  curves  in 
case  too  few  motif  hits  appeared  at  particular  positions. 

Association  of  Trends  in  Gene  Expression 
with  Transcription  Factor  Binding  Sites 

Gene  expression  data  were  normalized  and  summarized  using  RMA  (Irizarry 
et  al.,  2003)  and  updated  RefSeq  probeset  definitions  (Dai  et  al.,  2005).  Where 
multiple  transcripts  were  associated  with  a  single  gene  expression  index  the 
transcript  with  the  TSS  closest  to  a  ChIP-enriched  region  was  selected.  "Dif¬ 
ferentially  expressed”  genes  were  denoted  as  those  genes  with  a  t  test  p  value 
<  10"®.  Genes  “close”  to  a  ChIP  region  were  defined  as  those  having  such 
a  region  within  20  kb  of  the  TSS.  Fisher’s  exact  test  was  used  to  assess  the 
statistical  significance  of  the  association  between  close  genes  and  differen¬ 
tially  expressed  genes. 

De  Novo  Motif  Search 

De  novo  motif  searches  were  performed  on  sequences  ±100  bp  from  the 
centers  of  FoxAl  ChIP  regions  in  MCF7  cells  or  LNCaP  cells  by  using  LeitMotif 
(J.  Song  and  X.S.L.,  unpublished  data),  a  modified  MDscan  (Liu  et  al.,  2002) 
with  ninth-order  Markov  dependency  for  the  genome  background.  Motif  logos 
were  generated  by  enoLOGOS  (Workman  et  al.,  2005). 

RNA  Interference 

FoxAl  was  silenced  using  the  following  small-interfering  RNA  duplexes: 
siFoxAl  #1  sense  5'-GAGAGAAAAAAUCAACAGC-3':  antisense  5'-GCUGUU 
GAUUUUUUCUCUC-3'  (Carroll  et  al.,  2005;  Eeckhoute  et  al.,  2006)  and 
siFoxAl  #2  5'-GGACUUCAAGGCAUACGAAUU-3':  5'-UUCGUAUGCCUUGA 
AGUCCUU-3'  (Figure  S17).  SMARTpool  siRNA  directed  against  ERa  was 
purchased  from  Dharmacon.  Small-interfering  RNA  against  Luciferase  was 
used  as  a  negative  control  (Carroll  et  al.,  2005). 

DNase  I  Hypersensitivity  Assays 

DNase  I  hypersensitivity  assays  were  performed  as  in  Eeckhoute  et  al.  (2006). 

KDM1  Overexpression  Experiments 

A  total  of  15  |.ig  of  pCMX-KDMI  construct  or  the  control  empty  vector  were 
transfected  in  MCF7  cells  using  lipofectamine  2000  (Invitrogen)  according 
to  the  manufacturer’s  instructions.  After  76  hr  of  expression,  cells  were 
processed  for  ChIP-qPCR  as  previously  described. 

Real-Time  RT-PCR 

RNA  was  isolated  from  MCF7  and  LNCaP  cells  using  RNeasy  mini  kit  (QIA- 
GEN),  with  on-column  DNase  treatment  to  remove  contaminating  genomic 
DNA.  Real-time  reverse  transcription-PCR  (RT-PCR)  was  done  as  in  Keeton 
and  Brown  (2005).  Primers  used  in  RT-qPCR  are  listed  in  Table  S2. 

Western  Blots 

Western  blots  were  processed  as  described  in  Lupien  et  al.  (2007)  using  anti¬ 
bodies  against  KDM1  kindly  provided  by  R.  Schule  (Universitats-Frauenklinik 
und  Zentrum  fur  Klinische  Forschung,  Freiburg,  Germany),  FoxAl  (Abeam), 
and  Calnexin  (Stressgen  Biotechnologies). 


SUPPLEMENTAL  DATA 

Supplemental  Data  include  twenty-one  figures,  two  tables,  and  Supplemental 
References  and  can  be  found  with  this  article  online  at  http://www.cell.com/ 
cgi/content/full/1 32/6/958/DC1  /. 
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Proper  activation  of  transcriptional  networks  in  complex  organisms  is  central  to  the  response  to  stimuli.  We 
demonstrate  that  the  selective  activation  of  a  subset  of  the  estrogen  receptor  alpha  (ERa)  clstrome  in  MCF7 
breast  cancer  cells  provides  specificity  to  the  estradiol  (E2)  response.  ERa-speciflc  enhancers  that  are  subject 
to  E2-induced  coactivator-associated  arginine  methyltransferase  1  (CARMl)  action  are  critical  to  E2-stimu- 
lated  gene  expression.  This  is  true  for  both  FoxAl-dependent  and  independent  enhancers.  In  contrast,  a  subset 
of  E2-suppressed  genes  are  controlled  by  FoxAl-independent  ERa  binding  sites.  Nonetheless,  these  are  sites 
of  E2-induced  CARMl  activity.  In  addition,  the  MCF7  RNA  polymerase  II  cistrome  reveals  preferential 
occupancy  of  E2-regulated  promoters  prior  to  stimulation.  Interestingly,  E2-suppressed  genes  tend  to  lie  in 
otherwise  silent  genomic  regions.  Together,  our  results  suggest  that  the  transcriptional  response  to  E2  in 
breast  cancer  cells  is  dependent  on  the  interplay  between  polymerase  II  pre-occupied  promoters  and  the  subset 
of  the  ERa  cistrome  associated  with  coactivation. 


The  transcriptional  response  to  estrogen  in  numerous  tis¬ 
sues,  including  mammary  gland,  bone,  and  uterine  tissues,  and 
in  diseases  such  as  breast  cancer  is  dependent  on  estrogen 
receptor  alpha  (ERa).  Genome-wide  positional  analyses  de¬ 
fining  the  set  of  cw-regulatory  elements  recruiting  ERa,  known 
as  its  cistrome,  in  breast  cancer  cells  have  revealed  its  predom¬ 
inant  recruitment  to  enhancers  as  opposed  to  promoter  re¬ 
gions  (6,  7,  37,  39).  As  for  many  other  transcription  factors, 
genomic  recruitment  of  ERa  is  restricted  to  a  small  proportion 
of  its  putative  binding  sites  (<4.4%)  offering  a  primary  means 
of  defining  the  response  to  estradiol  (E2)  (5,  7,  37).  Similarly, 
the  promoter  predominant  Pol  II  recruitment  in  breast  cancer 
cells  is  restricted  to  a  subset  of  promoters  upon  E2  stimulation 
(7,  32,  33,  35).  Epigenetic  modifications  are  central  to  the 
lineage-specific  recruitment  at  enhancers  and  promoter  re¬ 
gions.  Indeed,  promoters  of  activated  genes  harbor  trimethyl- 
ated  histone  H3  lysine  4  (H3K4me3)  favoring  the  recruitment 
of  chromatin  remodeling  enzymes  and  histone  acetylases  (1, 
18,  42,  52,  56,  58).  In  contrast,  promoters  associated  with  tran¬ 
scriptional  repression  harbor  trimethylated  H3K27  (H3K27me3) 
(1,  3,  36,  42).  Similarly,  functional  enhancers  are  associated  with 
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mono-  and  dimethylation  of  H3K4  (H3K4mel,  me2)  restricting 
the  recruitment  and  the  chromatin  remodeling  activity  of  the 
pioneer  factor  FoxAl,  required  for  ERa  binding,  in  a  lineage- 
dependent  manner,  while  levels  of  H3K9me2  are  elevated  on 
nonfunctional  enhancers  (15,  25,  40). 

Despite  these  epigenetic  constraints,  RNA  polymerase  II 
(Pol  II)  and  ERa  together  are  recruited  to  more  than  9,000 
independent  high-confidence  (false  discovery  rate  [FDR],  1%) 
sites  across  the  genome  of  breast  cancer  cells  upon  E2  stimu¬ 
lation  (7).  Studies  limited  to  a  small  number  of  ERa  target 
sites  have  implicated  coactivators,  such  as  the  coactivator  as¬ 
sociated  arginine  methyltransferase  1  (CARMl),  in  the  E2 
response  (22).  As  they  are  recruited  to  ERa  binding  sites, 
coactivators  allow  for  a  series  of  posttranslational  modifica¬ 
tions  on  histones  and  other  coactivator  proteins  in  order  to 
facilitate  chromatin  remodeling  and  cycling  of  the  transcrip¬ 
tional  unit  essential  for  the  E2  response  (41,  57).  In  the  case  of 
CARMl,  this  involves  dimethylation  of  arginine  residues  on 
histone  H3  as  well  as  on  the  coactivator  AIBl  (8,  48).  In 
addition,  recent  studies  in  Drosophila  have  revealed  the  dom¬ 
inant  presence  of  poised  Pol  II  at  promoters  of  genes  involved 
in  the  response  to  stimuli  and  developmental  signals  (47,  69). 
In  the  present  study,  we  investigated  the  impact  of  CARMl 
coactivator’s  activity  on  ERa  binding  sites  and  of  Pol  II  at 
promoters  in  the  transcriptional  response  to  E2  through  ge¬ 
nome-wide  positional  analyses  in  human  breast  cancer  cells. 

MATERIALS  AND  METHODS 

ChIP-microarray  preparation.  Cells  were  hormone  deprived  for  3  days  in 
phenol  red-free  medium  (Invitrogen)  supplemented  with  10%  charcoal  dextran- 
treated  fetal  bovine  serum.  Cells  were  stimulated  with  the  estrogen  17|3-estradiol 
(10’'®  M)  for  45  min  and  cross-linked  by  using  1%  formaldehyde.  Samples  were 
sonicated  {Fisher  Sonic  Desmembrator,  model  500)  and  immunoprecipitated,  as 
previously  described  (40),  using  an  antibody  against  histone  H3  arginine  17 
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FIG.  1.  Establishing  classes  of  enhancer-rich  clusters  under  E2  treatment.  (A)  Cluster  analysis  according  to  the  binding  activity  for  the 
transcription  factor  ERa,  the  pioneer  factor  FoxAl,  the  mark  of  CARMl  activity  (an  antibody  raised  against  dimethylation  of  arginine  17  on 
histone  H3),  and  Pol  II  across  the  25,416  high-confidence  regions  recruiting  at  least  one  factor  from  all  analyzed  cistromes  established  through 
unbiased  genome-wide  ChIP-on-ChIP  in  MCF7  breast  cancer  cells  (E2,  E2  treated  for  45  min;  O,  vehicle  treated).  (B)  Genomic  distribution  of 
binding  sites  found  in  each  cluster  with  regard  to  the  TSS  of  known  genes  using  the  cw-regulatory  element  annotation  system  (28).  (C)  Average 
MAT  scores  of  ERa  and  FoxAl  and  the  difference  in  CARMl  activity  between  E2-treated  and  control  MCF7  cells  in  each  cluster.  The  average 
MAT  score  signal  for  ERa  or  FoxAl  for  the  various  clusters  significantly  different  from  a  1.5  average  MAT  score  is  presented.  Similarly,  the 
average  change  in  CARMl  activity  MAT  score  significantly  different  from  1  between  E2-  and  vehicle-treated  cells  is  presented.  *,  P  <  0.05;  **, 
P  <  0.01;  ***,  P  <  0.001.  (D)  Half-ERE  and  Forkhead  (FKH)  motif  enrichment  in  sites  from  each  cluster. 


dimethylated  (H3R17me2;  Upstate  Biotechnology,  07-214)  and  Pol  II  (Abeam, 
4H8;  Santa  Cruz  Biotechnology,  H-224).  Purified  samples  were  labeled  as  pre¬ 
viously  described  (6).  The  microarrays  used  were  Affymetrix  GeneChip  Human 
Tiling  2.0R  Ai'ray  Sets.  Genome-wide  chromatin  immunoprecipitation  (ChlP)- 
on-ChlP  analysis  was  conducted  by  using  a  model-based  analysis  of  tiling-arrays 


program  (MAT)  (30).  All  ChIP-on-ChIP  data  used  in  the  present  study  can  be 
accessed  at  http://research.dfci.harvard.edu/brownlab/datasets/. 

Cluster  analysis.  We  generated  a  set  of  genomic  intervals  derived  from  the 
union  of  all  high-confidence  sites  associated  with  either  ERa,  FoxAl,  or  CARMl 
activity  or  Pol  II.  Next,  we  assigned  the  score  to  each  interval  for  each  factor  as 
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FIG.  2.  E2-induced  CARMl  activity  at  ERa  sites  associates  with  activating  events.  (A)  Level  of  recruitment  for  the  coactivators  p300  and  SRCl 
under  vehicle  (O)  or  E2  treatment  established  by  ChIP-qPCR  on  eight  ERa  sites  associated  and  eight  not  associated  with  CARMl  activation  in 
MCF7  breast  cancer  cells.  (B)  Levels  of  histone  modifications,  namely,  H3K18ac,  H3K27ac,  and  H4K12ac,  were  determined  as  in  panel  A. 
(C)  Impact  of  E2  treatment  on  nucleosome  density.  The  changes  in  occupancy  of  the  core  histone  H3  were  determined  by  ChIP-qPCR  as  in  panel 
A.  Alterations  to  the  DNA  accessibility  were  determined  by  using  FAIRE  (21).  The  results  are  derived  from  a  minimum  of  two  independent 
experiments.  *,  P  <  0.05;  **,  P  ^  0.01;  ***,  P  ^  0.001. 


the  maximum  MAT  score  falling  within  the  interval  for  the  given  factor.  For  each 
factor  MAT  scores  were  trimmed  at  the  2.5  and  97.5  percentiles  and  scaled  to  lie 
between  0  and  1.  Genomic  regions  were  clustered  by  using  ^-means  clustering. 

Sequence  analysis.  Genome-wide  distribution  as  well  as  sequence  conserva¬ 
tion  analysis  of  H3R17me2  chip-on-chip  was  determined  by  using  cw-element 
annotation  systems  (28).  Enriched  motifs  within  clusters  as  well  as  the  associa¬ 
tions  with  gene  expression  were  analyzed  as  described  previously  (40). 

CHIP  assays.  At  2  to  3  days  before  induction,  MCF-7  cells  were  seeded  in 
phenol  red-free  Dulbecco  modified  Eagle  medium  supplemented  with  10%  char- 
coal-dextran-treated  fetal  bovine  serum  (Omega  Scientific,  Inc.,  Tarzana,  CA),  2 
mM  L-glutamine,  and  100  U  of  penicillin-streptomycin/ml  at  a  density  of  5  X  10® 
cells  per  150-mm  plates.  Cells  were  subsequently  induced  with  10“®  M  E2  for  45 
min.  Chip  experiments  were  then  performed  as  described  previously  (16).  An¬ 
tibodies  to  ERa  (Lab  Vision,  Ab-10;  Santa  Cruz  Biotechnology,  sc-543), 
H3R17me2  (Upstate  Biotechnology,  07-214),  H3K18ac  (Upstate  Biotechnology, 
07-354),  H3K27ac  (Upstate  Biotechnology,  07-360),  H4K12ac  (Upstate  Biotech¬ 
nology,  07-595),  H3  (Abeam,  abl791),  p300  (Santa  Cruz  Biotechnology,  sc-585), 
and  SRCl  (Santa  Cruz  Biotechnology,  sc-8995)  were  used  for  this  assay.  Purified 
DNA  was  used  in  quantitative  PCR  (qPCR)  analysis.  The  primers  used  in  this 
analysis  are  listed  in  Table  SI  in  the  supplemental  material.  Immunoprecipitated 


DNA  amounts  were  normalized  to  inputs  and  are  expressed  as  the  relative 
enrichment. 

FAIRE  analysis.  Formaldehyde-assisted  isolation  of  regulatory  elements 
(FAIRE)  was  performed  as  described  in  reference  21.  The  primers  used  in  this 
analysis  are  listed  in  Table  SI  in  the  supplemental  material. 


RESULTS 

Distinct  enhancer-rich  clusters  characterize  genomic  ERa 
recruitment.  In  order  to  better  characterize  the  impact  of  co¬ 
activator  action  on  the  ERa  cistrome  upon  E2  stimulation,  we 
have  established  the  relative  level  of  CARMl  activity  across 
the  genome  of  MCF7  breast  cancer  cells.  This  was  achieved 
through  Chip  studies  combined  with  whole-genome  tiling-path 
microarrays  (ChIP-on-ChIP)  using  an  antibody  that  recognizes 
exclusively  sites  of  CARMl-dependent  arginine  methylation, 
including  histone  H3  dimethylated  on  arginine  17  (H3R17me2) 
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FIG.  3.  Clusters  associated  with  CARMl  activation  drive  the  response  under  E2  treatment.  (A)  Proportion  of  E2  upregulated  genes  compared 
to  nonregulated  genes  with  at  least  one  binding  site  from  a  specific  cluster  within  increasing  window  distances  from  their  TSS  in  MCF7  cells.  *, 
P  £  0.05;  **,  P  £  0.01;  ***,  P  <  0.001.  (B)  GPER  and  TESK2  expression  after  CARMl  silencing  in  MCF7  was  determined  by  reverse 
transcription-qPCR  and  revealed  the  requirement  for  CARMl  in  the  E2-mduced  repression  of  GPER  and  TESK2.  siLUC  was  used  as  a  control. 
*,  P  s  0.05;  **,  P  £  0.01;  ***,  P  £  0.001.  (C)  Enrichment  of  Ec3  cluster  sites  (blue  blocks)  near  GPER  and  TESK2  E2-downregulated  target 
genes  (red  blocks). 


and  the  CARMl-dependent  arginine  methylation  of  AIBl  (see 
Fig.  SI  and  S2  in  the  supplemental  material)  (9,  48,  67).  More 
than  4,088  and  4,461  high-confidence  sites  were  identified  be¬ 
fore  and  after  E2  stimulation,  respectively  (FDR,  6%)  (see  Fig. 


SIA  and  B  in  the  supplemental  material).  Interestingly, 
CARMl  activity  was  found  predominantly  (94.1%)  at  regions 
far  from  known  promoters  (see  Fig.  SIC  in  the  supplemental 
material).  The  Pol  II  cistrome  was  also  determined  in  the 
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FIG.  4.  ERa-positive  primary  breast  tumor  expression  profile  relates  to  clusters  associated  with  CARMl  activation.  Relationship  between 
cluster-associated  gene  list  (genes  with  a  binding  site  from  a  given  cluster  within  20  kb  of  their  TSS)  and  genes  overexpressed  in  ERa-positive 
primary  breast  tumors  (the  top  1,  5,  or  10%  overexpressed  genes  from  primary  breast  tumors  were  included  in  the  analysis).  Twenty  independently 
defined  ERa-positive  primary  breast  tumor  overexpressing  gene  signatures  (blue)  were  compared  using  an  Oneomine  Concepts  Map  to  the  five 
enhancer  clusters  (red)  derived  gene  lists.  Odds  ratios  (OR)  are  presented  when  clusters  are  significantly  associated  with  independent  primary 
breast  cancer  overexpression  gene  signatures  {P  <  6e“®). 


absence  of  E2  to  address  the  role  of  promoter-associated  fac¬ 
tors  in  this  system  (see  Fig.  S3  in  the  supplemental  material). 
As  anticipated,  of  the  7,420  high-confidence  sites  (FDR,  5%) 
recruiting  Pol  II,  55.4%  were  recruited  within  I  kb  upstream  of 
annotated  transcription  start  sites  (TSS)  (see  Fig.  S3A  to  C  in 
the  supplemental  material). 

To  establish  the  contribution  of  CARMl  activation  and  Pol 
II  recruitment  to  F2  signaling,  we  combined  our  newly  derived 
cistromes  with  previously  published  cistromes  for  FoxAl  in  the 
presence  or  absence  of  F2,  as  well  as  FRa  and  Pol  II  in 
F2-treated  MCF7  cells  (7,  40).  We  first  established  the  binding 
activity  as  determined  by  MAT  score  (29)  for  all  factors  across 
the  25,416  high-confidence  regions  recruiting  at  least  one  of 
these  factors  in  MCF7  cells.  The  use  of  k-means  clustering 
revealed  five  enhancer-rich  clusters  and  two  promoter-rich 
clusters  (Fig.  lA  to  C).  Interestingly,  each  cluster  consisted  of 
sites  demonstrating  high  sequence  conservation  across  verte¬ 
brate  species  (see  Fig.  S4  in  the  supplemental  material).  FRa 
was  most  significantly  recruited  after  F2  stimulation  to  clusters 
Fcl  (23%  of  the  5782  FRa  high-confidence  sites)  and  Fc3 
(47%  of  the  5782  FRa  high-confidence  sites),  with  <2.6%  of 
the  high-confidence  sites  found  at  promoters  (Fig.  lA  and  C). 
The  previously  reported  sites  of  FoxAl  recruitment  favoring 
FRa  binding  were  found  predominantly  in  cluster  Fcl  but  not 
Fc3  (Fig.  lA  and  C).  Correspondingly,  both  cluster  Fcl  and 
Fc3  were  highly  enriched  for  the  FRF  half-site  motif,  while  the 
Forkhead  motif  was  only  enriched  in  cluster  Fcl  (Fig.  ID).  In 
addition,  both  clusters  demonstrated  F2-induced  CARMl  ac¬ 
tivity  as  measured  by  the  MAT  score  (Fig.  lA  and  C  and  see 
Fig.  S5  in  the  supplemental  material).  Cluster  Fc2  consisted  of 
sites  found  at  <2.1%  of  promoters  where  FoxAl  was  strongly 
recruited  but  where  FRa  had  low  binding  activity  (Fig.  lA  to 
C  and  see  Fig.  S5  in  the  supplemental  material).  Accordingly, 
the  Forkhead  motif  was  enriched  in  this  cluster,  while  the  FRF 
half-site  motif  was  not  significantly  enriched  (Fig.  ID).  In  ad¬ 
dition,  CARMl  activity  was  not  induced  on  sites  from  this 
cluster  following  F2  stimulation  (Fig.  lA  and  C  and  see  Fig.  S5 
in  the  supplemental  material).  Finally,  sites  from  the  enhancer- 


rich  clusters  Fc4  and  Fc5,  with  <6.1%  of  sites  at  promoters, 
did  not  demonstrate  strong  FRa  recruitment.  However,  sites 
from  cluster  Fc4  but  not  Fc5  associated  with  FoxAl  binding. 
In  addition,  ligand-independent  CARMl  activity  was  associ¬ 
ated  with  cluster  Fc5  independently  of  F2  stimulation  (Fig.  lA 
and  C  and  see  Fig.  S5  in  the  supplemental  material).  Globally, 
these  data  reveal  that  various  classes  of  regulatory  elements 
are  established  under  F2  stimulation,  and  those  associated 
with  FRa  and  FoxAl  recruitment,  as  well  as  CARMl  activity, 
are  found  predominantly  in  enhancer  regions  across  the  ge¬ 
nome. 

E2-induced  CARMl  activity  associates  with  coactivator  re¬ 
cruitment,  histone  modifications,  and  chromatin  opening.  A 

common  feature  of  sites  from  clusters  Fcl  and  Ec3  predomi¬ 
nantly  involved  in  the  E2-mediated  regulation  of  gene  expres¬ 
sion  is  their  association  with  the  induction  of  CARMl  activity 
after  E2  treatment  (Fig.  lA  and  C).  In  order  to  better  char¬ 
acterize  the  active  state  of  these  enhancer  regions,  we  investi¬ 
gated  the  level  of  coactivator  recruitment  and  histone  modifi¬ 
cations  after  E2  treatment.  Sites  recruiting  ERa  and  associated 
with  CARMl  activity  significantly  recruited  other  coactivators, 
such  as  p300  and  SRCl,  under  E2  stimulation  (Fig.  2A  and  see 
Fig.  S6  in  the  supplemental  material).  Similarly,  histone  mod¬ 
ifications,  such  as  acetylation  of  lysine  18  or  27  on  histone  H3 
(H3K18ac,  H3K27ac),  as  well  as  on  lysine  12  of  histone  H4 
(H4K12ac),  were  significantly  induced  by  E2  on  these  same 
sites  (Fig.  2B).  ERa  binding  sites  not  associated  with  the  in¬ 
duction  of  CARMl  activity  did  not  demonstrate  any  significant 
induction  of  coactivator  recruitment  or  histone  modification 
under  E2  treatment  (Fig.  2A  and  B).  It  is  noteworthy  that  ERa 
binding  sites  undergoing  coactivator  recruitment  and  histone 
modifications  after  E2  treatment  also  associated  with  E2-in- 
duced  chromatin  opening  measured  by  histone  H3  density  or 
extractability  by  FAIRE  (21)  (Fig.  2C).  Considering  that  30% 
of  ERa  binding  sites  are  not  associated  with  clusters  Eel  or 
Ec3  typified  by  E2-inducted  CARMl  activity,  our  results  reveal 
that  the  specific  transcriptional  response  to  E2  is  in  part  de- 
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FIG.  5.  Cell-type  specific  coactivation  of  ERa  binding  sites  associates  with  the  transcriptional  response.  (A)  Relative  expression  of  PDK4  and 
FasL  genes  after  E2  treatment  for  3  h  in  MCF7  breast  cancer  and  U20S/ERa  cells.  (B)  Relative  enrichment  of  ERa  and  CARMl  activity 
established  by  ChIP  in  both  MCF7  and  U20S/ERa  cells  after  E2  treatment  at  the  PDK4  and  FasL  enhancers. 


pendent  on  the  selective  activation  of  a  fraction  of  sites  re¬ 
cruiting  ERa. 

CARMl  activation  on  ERa  binding  sites  drives  the  tran¬ 
scriptional  response  to  E2.  In  order  to  address  the  role  of  the 
various  enhancer-rich  clusters  in  gene  regulation,  we  estab¬ 
lished  the  proportion  of  genes  regulated  after  3  h  of  E2  treat¬ 
ment  versus  nonregulated  genes  with  at  least  one  binding  site 
from  a  particular  cluster  within  increasing  window  distances  in 
kilobases  from  the  TSS.  This  revealed  a  significant  enrichment 
of  E2  upregulated  genes  over  nonregulated  genes  with  regard 
to  sites  from  clusters  Eel,  Ec2,  and  Ec3  from  various  window 
distances  from  the  TSS,  as  far  as  160  to  320  kb  for  both  Eel 
and  Ec3  (Fig.  3A).  Hence,  our  results  suggest  that  the  subset  of 
the  ERa  cistrome  subject  to  CARMl  activation  upon  E2  treat¬ 
ment,  whether  strongly  or  weakly  associated  with  FoxAl  bind¬ 
ing,  is  responsible  for  E2-mediated  gene  induction.  Thus,  the 
previously  suggested  role  for  CARMl  in  mediating  the  E2 
response  (19,  67)  is  due  to  its  activity  at  enhancer  regions 
defined  by  a  specific  subset  of  the  ERa  cistrome.  Interestingly, 


genes  downregulated  after  E2  stimulation  were  significantly 
enriched  over  nonregulated  genes  near  sites  primarily  from 
cluster  Ec3  that  could  be  as  far  away  as  160  to  320  kb  (Fig.  3A). 
Accordingly,  silencing  CARMl  (see  Fig.  S2A  in  the  supple¬ 
mental  material)  significantly  prevented  the  E2-mediated  re¬ 
pression  of  GPER  and  TESK2  (Fig.  3B  and  C).  Hence,  this 
finding  suggests  a  predominant  role  for  ERa  sites  associated 
weakly  or  not  at  all  with  FoxAl  and  undergoing  ligand-depen¬ 
dent  CARMl  activity  in  E2-mediated  gene  downregulation. 

In  order  to  address  the  physiological  relevance  of  the  differ¬ 
ent  clusters,  we  compared  the  list  of  genes  with  a  binding  site 
from  a  particular  cluster  within  20  kb  of  their  TSS  to  the  top 
genes  coexpressed  with  ERa  in  primary  breast  tumors  from  20 
independent  studies  (4,  10,  13,  20,  23,  26,  27,  43,  44,  50,  51,  54, 
55,  60-62,  64-66,  68,  70).  Remarkably,  genes  coexpressed  with 
ERa  defined  in  19  out  of  the  20  independent  studies  were 
highly  associated  (odds  ratio  ^  3)  with  sites  from  Eel  within  20 
kb  of  their  TSS  (Fig.  4).  Less  significant  association  (odds  ratio 
between  2  and  3)  between  ERa  coexpressed  genes  and  sites 
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FIG.  6.  Pol  II  occupied  promoter  of  E2  target  genes  prior  to  stimulation.  (A)  Cluster  analysis  performed  as  described  for  Fig.  lA  across  the 
25,903  promoter  regions  associated  with  the  RefSeq  genes.  (B)  Proportions  of  all,  E2-upregulated,  and  downregulated  genes  with  a  promoter 
typical  of  clusters  PcP™™,  Pc2'’™“,  or  PcNulF™™.  (C)  Enrichment  of  E2-downregulated  versus  nonregulated  genes  with  at  least  one  promoter  of 
the  PclP™™,  Pc2P'°™,  or  PcNuIIp™™  cluster  within  increasing  window  distances  from  the  genes’  TSS.  (D)  Specific  examples  of  E2  downregulated 
genes  surrounded  by  gene  with  Pol  II  deprived  promoters.  Sites  from  cluster  Pcl^™™  (orange),  Pc2'”’°™  (pink),  or  PcNulF™”  (dark  blue)  are 
presented  with  respect  to  E2-downregulated  genes  (red  block).  (E)  Relative  chromatin  accessibilities  of  promoters  from  cluster  PcIp™™,  Pc2P''°'“, 
or  PcNulF™”  measured  by  FAIRE-ChIP.  *,  P  <  0.05;  **,  P  £  0.01;  ***,  P  £  0.001. 


from  Ec2  and  Ec3  was  also  detected  in  2  out  of  the  20  inde¬ 
pendent  expression  profiles  from  primary  breast  tumors  (Fig. 
4).  Hence,  these  results  further  support  the  predominant  reg¬ 
ulatory  role  of  sites  from  cluster  Eel  and  less  significantly  from 
clusters  Ec2  and  Ec3  in  the  establishment  of  the  phenotype  of 
ERa-positive  breast  cancers. 


Further  evidence  for  the  association  between  sites  of  ERa 
recruitment  and  their  activation  to  mediate  transcriptional 
program  originates  from  the  comparison  of  the  MCF7  breast 
cancer  and  U20S  osteosarcoma  cell  lines.  Indeed,  E2  treat¬ 
ment  in  both  cell  lines  allows  for  the  recruitment  of  ERa  to  a 
number  of  common  sites  (34).  Interestingly,  the  transcriptional 
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FIG.  7.  Model  for  the  selection  of  functional  and  active  enhancer  sites  in  response  to  estrogen  stimulation  in  breast  cancer  cells.  A  schematic 
representation  of  the  transcriptional  response  to  E2  stimulation  in  breast  cancer  cells  is  shown.  The  functional  association  between  ERa  recruiting 
sites  undergoing  coactivator  (CoA)  recruitment/activation  and  histone  modifications  with  transcriptional  regulation  of  the  gene  harboring  Pol  II 
at  their  promoters  both  prior  to  and  after  E2  stimulation  is  depicted.  Sites  of  ERa  recruitment  not  associated  with  these  secondary  events  do  not 
significantly  impact  E2-induced  regulation  of  gene  expression. 


program  is  cell  line  specific  (34).  For  instance,  PDK4  and  FasL 
are  two  U20S-specific  E2-induced  genes  (Fig.  5A).  Although 
ERa  gets  recruited  to  the  PDK4  and  FasL  enhancers  in  both 
cell  lines,  they  are  coactivated  only  in  U20S  cells  (Fig.  5B). 
These  results  suggest  that  ERa  recruitment  associates  with 
coactivation  in  order  to  mediate  gene  expression. 

Pol  II  occupies  the  promoter  of  E2  regulated  genes  prior  to 
stimulation.  To  address  how  different  components  of  the  tran¬ 
scriptional  response  to  E2  signaling  impact  promoter  activity  in 
breast  cancer  cells,  we  performed  A;-means  clustering  on  the 
25,903  RefSeq  gene  promoters  as  described  for  Fig.  lA.  Three 
distinct  clusters  could  be  defined  (Fig.  6A).  The  first  promoter 
cluster  (Pclf’'°™)  consisting  of  4,846  sites  revealed  strong  Pol  II 
recruitment  both  prior  to  and  after  E2  stimulation  and  no 
significant  recruitment  of  ERa,  FoxAl  nor  evidence  of 
CARMl  activity  (Fig.  6A).  Similarly,  the  7,316  promoters 
found  in  the  second  cluster  (Pc2P''°'”)  were  specifically  en¬ 
riched  for  Pol  II  both  prior  to  and  after  E2  stimulation,  albeit 
at  lower  levels  than  on  sites  from  PcIP’^°™  (Fig.  6A).  Finally, 
13,741  promoters  (PcNu11P'^°™)  in  MCF7  cells  were  not  signif¬ 
icantly  associated  with  the  recruitment  of  Pol  II,  ERa,  FoxAl, 
or  CARMl  activity  (Fig.  6A).  Interestingly,  more  than  47%  of 
promoters  were  associated  with  sites  from  either  Pc1P'^°™  or 
Pc2prom;  PqJ  jj  Qccupied  promoters  prior  to  E2 

stimulation  (Fig.  6A  and  B  and  see  Fig.  S7A  in  the  supple¬ 
mental  material).  Strikingly,  more  than  85%  of  the  E2  upregu- 
lated  and  74%  of  E2  downregulated  genes  had  a  promoter 
typical  of  either  cluster  Pc1P’^°”  or  Pc2P'^°”  (Fig.  6B  and  see 
Fig.  S7A  in  the  supplemental  material).  Furthermore,  down- 


regulated  genes  were  typically  found  in  the  regions  of  Pol  II 
unoccupied  promoters  (Fig.  6C  and  D).  In  agreement  with  this, 
the  promoters  of  genes  surrounding  downregulated  genes  were 
found  in  condensed  chromatin  measured  by  FAIRE  (Fig.  6E) 
(17).  This  suggests  that  stimulus-dependent  gene  regulation  is 
predominantly  dependent  on  receptive  promoters  as  defined 
by  the  presence  of  Pol  II  prior  to  stimulation  and  on  the  state 
of  promoter  occupancy  in  neighboring  genes. 

DISCUSSION 

The  selective  utilization  of  enhancer  regions  and  promoters 
is  central  to  the  establishment  of  lineage-specific  transcrip¬ 
tional  programs  and  stimuli  specific  responses.  Exploiting  the 
E2  signaling  pathway,  we  have  combined  the  cistromes  from 
different  components  of  a  transcriptional  response,  namely, 
the  pioneer  factor  FoxAl,  the  transcription  factor  ERa,  a 
marker  of  the  activity  of  the  coactivator  CARMl,  and  Pol  II. 
Our  results  reveal  that  FoxAl-dependent  and  -independent 
ERa  sites  coactivated  upon  E2  treatment  are  predominantly 
driving  the  response  to  E2  in  breast  cancer  cells  (Fig.  7). 
Hence,  the  specific  transcriptional  program  associated  with  E2 
stimulation  is  not  only  dependent  on  the  restricted  genomic 
recruitment  of  ERa  but  also  on  the  activation  of  a  selected 
number  of  binding  sites  associated  with  coactivator  recruit¬ 
ment  and  histone  modifications.  This  is  in  agreement  with  the 
central  role  of  coactivators  in  the  response  to  E2  in  both  cell 
lines  and  mouse  models  (19,  46,  59,  67).  Interestingly,  our 
results  also  reveal  an  association  between  ERa  binding  sites 
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displaying  E2-induced  CARMl  activity  and  gene  repression. 
As  we  previously  suggested,  squelching  and/or  displacement  of 
transcriptional  units  with  greater  regulatory  capacities  could 
account  for  this  association  (7).  In  addition,  we  demonstrate  on 
a  genome-wide  scale  that  enhancers  located  as  far  as  160  to  320 
kb  from  the  TSS  of  the  E2  target  genes  mediate  the  transcrip¬ 
tional  response.  This  is  in  accordance  with  previous  studies 
revealing  the  intrachromosomal  interactions  required  for  op¬ 
timal  transcriptional  response  upon  E2  stimulation  in  MCF7 
cells  (6,  12).  Considering  the  commonality  of  such  long-range 
interactions  between  promoters  and  enhancers  (38,  49,  63), 
defining  how  these  are  established  on  a  genome-wide  scale  is 
of  fundamental  importance. 

Furthermore,  we  show  that  Pol  II  promoter  occupancy  is 
typical  of  E2  responsive  genes  in  MCF7  breast  cancer  cells  that 
are  both  up-  and  downregulated.  This  is  reminiscent  of  studies 
in  Drosophila  revealing  the  contribution  of  stalled  or  poised 
Pol  II  at  the  promoter  of  genes  involved  in  the  response  to 
stimuli  and  developmental  signals  (47,  69).  It  also  parallels 
previous  work  revealing  Pol  II  at  the  promoter  of  unexpressed 
genes  (2,  24,  31-33,  53).  It  is  consistent  with  the  concept  of  Pol 
II  foci  in  the  nucleus  known  as  transcription  factories  that 
remain  intact  in  the  absence  of  transcription  (45).  In  fact, 
postrecruitment  regulation  of  Pol  II  was  recently  revealed  to 
be  central  for  the  rapid  signaling  response  to  estrogen  (33). 
Interestingly,  we  identified  a  difference  between  the  genomic 
environments  of  E2  up-  versus  downregulated  genes.  Indeed, 
although  Pol  II  typically  occupies  the  promoter  of  E2  down¬ 
regulated  genes,  the  promoters  of  surrounding  genes  tend  to 
be  deprived  of  Pol  II.  Hence,  the  transcriptional  response  is 
dependent  on  the  presence  of  a  receptive  promoter  typified  by 
Pol  II  occupancy  prior  to  stimulation. 

Globally,  our  results  reveal  that  the  specificity  of  the  tran¬ 
scriptional  response  to  E2  stimulation  is  dependent  on  the 
interplay  between  receptive  promoters  occupied  by  Pol  II  prior 
to  stimulation  and  subclasses  of  ERa  enhancers  associated 
with  E2-induced  coactivator  activity.  Considering  the  unique 
expression  profiles  associated  with  ERa  activation  under  dis¬ 
tinct  stimuli  (11,  14),  it  remains  to  be  established  whether 
distinct  subclasses  of  the  ERa  cistrome  will  be  involved  in 
these  responses  as  well. 
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Abstract 

Alterations  in  transcription  programs  are  a  fundamentai  feature  of  cancer.  Nuclear  receptors,  such 
as  the  estrogen  receptor  alpha  (ERa)  and  androgen  receptors  (ARs),  are  central  in  this  process  as 
they  can  directly  impact  gene  expression  through  interaction  with  the  chromatin  and  subsequent 
association  with  coregulators  and  the  transcriptional  machinery.  Unbiased  genome-wide 
investigations  have  demonstrated  the  predominant  recruitment  of  both  ERa  and  AR  to  distant 
(non-promoter)-regulatory  elements.  Furthermore,  these  studies  revealed  a  clear  relationship 
between  sites  of  transcription  factor  recruitment  and  gene  regulation.  Indeed,  expression  profiles 
from  AR-positive  primary  prostate  tumors  and  cell  lines  directly  relate  to  the  AR  cistrome  in  prostate 
cancer  cells,  while  the  ERa  cistrome  in  breast  cancer  cells  relates  to  expression  profiles  from 
ERa-positive  primary  breast  tumors.  Additionally,  cell-type-specific  ERa  cistromes  are  linked  to 
lineage-specific  estrogen-induced  expression  profiles  in  different  cell  types,  for  example 
osteosarcoma  and  breast  cancer  cells.  The  pioneer  factor  forkhead  box  A1  (FoxA1/HNF3a) 
plays  a  central  role  in  AR  and  ERa  signaling.  It  is  recruited  in  a  lineage-specific  manner  translating 
the  epigenetic  signature  consisting  of  mono-  and  dimethylated  histone  H3  on  lysine 
4  (FI3K4me1/me2)  into  functional  regulatory  elements.  Hence,  through  the  interplay  between  the 
pioneer  factor,  namely  FoxAl ,  and  epigenetic  events,  the  transcriptional  potential  of  a  given  cell 
lineage  is  predefined.  Since  this  directly  impacts  signaling  through  nuclear  receptors,  these 
discoveries  should  significantly  impact  the  development  of  novel  therapeutic  strategies  directed 
against  multiple  types  of  cancer. 
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Characteristics  of  nuclear  receptor 
cistromes 

Gene  expression  profiling  of  cancer  has  led  to  important 
new  insights  both  in  terms  of  classification  and  outcome. 
This  is  exemplified  in  breast  cancer  through  studies 
where  different  subtypes  of  primary  breast  tumors  have 
been  identified  (Sorlie  et  al.  2001,  2003),  which  also 
correspond  to  different  disease/treatment  outcomes 
(Sorlie  et  al.  2003).  Similarly,  prostate  cancer  develop¬ 
ment  associates  with  transcriptional  programs  distinct 
from  normal  tissues  (Welsh  et  al.  2001,  Lapointe  et  al. 
2004,  Yu  et  al.  2004).  Therefore,  understanding  the 
mechanisms  that  lead  to  these  altered  expression  profiles 
is  fundamental  to  the  development  of  effective  thera¬ 
peutic  intervention  against  cancers. 

Nuclear  receptors  are  central  to  the  development  of 
both  breast  and  prostate  cancer.  The  estrogen  receptor 


alpha  (ERa)  is  a  fundamental  feature  of  more  than  two- 
thirds  of  breast  cancers  (Sorlie  et  al.  2001,  2003),  while 
prostate  cancer  is  highly  dependent  on  the  actions  of 
the  androgen  receptor  (AR;  Heinlein  &  Chang  2004). 
Both  ERa  and  AR  are  ligand-dependent  transcription 
factors  recruited  directly  to  the  chromatin  through  the 
estrogen-responsive  elements  (EREs)  and  androgen- 
responsive  elements  (AREs)  respectively.  They  are 
also  indirectly  recruited  to  other  genomic  regions 
through  a  tethering  mechanism  involving  other 
transcription  factors  such  as  AP-1  and  Spl  (Sanchez 
et  al.  2002).  Through  their  interplay  with  coregulators, 
ERa  and  AR  regulate  the  expression  of  genes  central  to 
breast  and  prostate  cancer  development,  including 
CCNDl,  E2F1,  Myc  as  well  as  TMPRSS2,  and  PSA 
respectively  (Prall  et  al.  1998,  Balk  et  al.  2003, 
Demichelis  &  Rubin  2007,  Slender  et  al.  2007,  Erietze 
et  al.  2008,  Setlur  et  al.  2008). 
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Recent  technological  advancements  have  allowed 
the  mapping  of  the  regulatory  regions  recruiting  either 
directly  or  through  tethering  mechanisms  these 
receptors  on  a  genome-wide  scale  defining  their 
cistromes  (Lupien  et  al.  2008).  Indeed,  upon  stimu¬ 
lation  ERa  and  AR  are  recruited  to  1000  of  sites  in 
hreast  and  prostate  cancer  cell  lines  respectively 
(Carroll  et  al.  2005,  2006,  Lin  et  al.  2007,  Wang 
et  al.  2007,  Hua  et  al.  2008,  Hurtado  et  al.  2008,  Liu 
et  al.  2008,  Lupien  et  al.  2008).  These  unbiased  genome¬ 
wide  studies  have  revealed  the  preferential  recruitment 
of  both  ERa  and  AR  with  non-promoter-regulatory 
elements.  This  pattern  of  promoter-distant  recruitment 
is  also  typical  of  other  transcription  factors  in  various 
systems  such  as  forkhead  box  Al  (EoxAl),  RelA  (p65), 
NRSE,  SRA,  GABP,  and  many  more  (Carroll  et  al. 

2006,  Johnson  et  al.  2007,  Lim  et  al.  2007,  Lupien 
et  al.  2008,  Valouev  et  al.  2008).  This  contrasts  with 
the  distribution  of  other  transcription  factors  such  as 
E2E  family  members  that  are  primarily  recruited  to 
promoters  (Bieda  et  al.  2006,  Xu  et  al.  2007).  This 
highlights  the  complexity  of  the  transcriptional 
response  in  higher  order  eukaryotes  as  regulatory 
elements  can  be  found  hundreds  of  kilobases  (kb)  away 
from  their  target  genes,  but  still  be  brought  into 
promoter  close  proximity  through  chromosome  loop¬ 
ing  (Dekker  2008).  In  fact,  this  process  has  been 
previously  reported  on  a  limited  subset  of  ERa-  and 
AR-binding  sites  for  regions  up  to  140  kb  away  from 
the  target  gene  (Carroll  et  al.  2005,  Deschenes  et  al. 

2007,  Wang  et  al.  2007).  Hence,  a  key  milestone  in 
transcriptional  biology  will  consist  of  establishing  the 
sum  of  all  chromosome  loops  guiding  transcriptional 
responses  on  a  genome-wide  scale. 

From  nuclear  receptor  cistrome  to 
transcription 

The  comparison  of  nuclear  receptor  cistromes  and 
hormone-regulated  expression  profiles  reveals  a  clear 
relationship  between  the  two.  Indeed,  genes  over¬ 
expressed  in  ERa-positive  primary  breast  tumors  as 
well  as  estrogen  target  genes  in  breast  cancer  cell  lines 
are  preferentially  surrounded  by  estrogen-induced 
ERa-binding  sites  found  in  this  same  system  (Carroll 
et  al.  2006,  Lupien  et  al.  2008,  Eig.  1).  Similarly, 
MCE7  cells  overexpressing  AKT  induce  a  unique  ERa 
cistrome  that  relates  to  the  AKT-dependent  expression 
profile  (Bhat-Nakshatri  et  al.  2008,  Fig.  1).  This  is 
also  observed  in  osteosarcoma  where  the  estrogen- 
induced  ERa  cistrome,  distinct  from  the  ERa 
cistrome  in  breast  cancer  cells,  directly  relates  to  the 
estrogen-induced  expression  profile  in  osteosarcoma 


(ICrum  et  al.  2008,  Fig.  1).  Furthermore,  ERa-binding 
sites  cluster  around  these  regulated  genes  (Krum  et  al. 
2008).  Similarly,  the  AR  cistrome  in  androgen- 
dependent  prostate  cancer  cells  relates  to  the  andro¬ 
gen-induced  transcriptional  program  in  these  same 
cells  as  well  as  to  the  expression  profile  from  primary 
prostate  tumors  (Wang  et  al.  2007,  2009,  Lupien  et  al. 
2008,  Fig.  2).  Furthermore,  as  prostate  cancer  cells 
become  castration  resistant  following  androgen-depri¬ 
vation  therapy,  they  acquire  an  altered  expression 
program  accompanied  by  a  related  novel  AR  cistrome 
(Wang  et  al.  2009).  Hence,  the  capacity  to  establish 
specific  cistromes  under  distinct  activation  and  in 
different  lineages  is  central  to  the  implementation  of 
transcriptional  programs  that  define  the  nature  of 
cellular  identity. 

Coregulators  central  to  nuclear  receptor 
cistromes 

Through  sequence  analysis  of  regulatory  regions 
recruiting  either  ERa  or  AR,  conserved  networks  of 
regulatory  factors  have  been  defined  (Carroll  et  al. 
2005,  2006,  Laganiere  et  al.  2005,  Green  &  Carroll 
2007,  Wang  et  al.  2007,  Hurtado  et  al.  2008). 
Noteworthy,  the  GATA,  OCT,  PAX,  NKX,  and  LEE 
motifs  are  significantly  enriched  near  the  center  of 
ERa-  and/or  AR-binding  sites.  GATA3  recognizing 
the  GATA  motif  was  revealed  to  be  part  of  a  positive 
cross-regulatory  loop  with  ERa  in  breast  cancer  cells 
required  for  the  estrogen-mediated  cell  proliferation 
(Eeckhoute  et  al.  2007).  In  prostate  cancer  cells, 
GATA2  also  recognizing  the  GATA  motif  was  found 
to  interact  with  AR  and  potentiates  its  regulation  of 
target  genes  (Perez-Stable  et  al.  2000,  Wang  et  al. 
2007).  Similarly,  Oct-1  recognizing  the  OCT  motif  was 
shown  to  physically  interact  with  AR  and  its 
expression  is  required  for  AR-mediated  transcriptional 
regulation  in  prostate  cancer  cells  (Gonzalez  &  Robins 
2001,  Wang  et  al.  2007).  Furthermore,  Oct-1  is 
co-recruited  with  ERa  in  breast  cancer  cells  regulating 
key  target  genes,  namely  CCNDl  (Cicatiello  et  al. 
2004,  Carroll  et  al.  2006).  More  recently,  PAX2 
co-recruitment  with  ERa  to  the  £'RBB2-regulatory 
element  has  revealed  its  central  role  as  a  transcriptional 
repressor  required  for  inhibition  of  ERBB2  expression 
in  breast  cancer  cells  (Hurtado  et  al.  2008).  Accor¬ 
dingly,  loss  of  PAX2  recruitment  allowed  for  ERBB2 
expression  in  the  presence  of  the  anti-estrogen 
tamoxifen  conferring  anti-estrogen-resistant-like  prop¬ 
erties  to  normally  anti-estrogen-sensitive  breast  cancer 
cells  (Hurtado  et  al.  2008).  Other  factors  such  as  LEF-1 
and  Nkx3-1  whose  DNA  recognition  motif  is  enriched 
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Figure  1  Lineage-  and  stimuli-specific  transcriptional  programs  are  dependent  on  differential  recruitment  of  ERa.  Thousands  of 
putative  ERa-binding  sites  are  found  across  the  human  genome.  This  includes  over  60  000  estrogen-responsive  elements  (EREs) 
and  a  number  of  regions  recruiting  ERa  through  a  tethering  mechanism.  However,  lineage-specific  ERa  recruitment,  as  reported 
between  breast  and  osteosarcoma  cancer  cell  lines,  is  central  to  the  unique  transcriptional  program  generated  in  each  cell  type 
following  estrogen  (E2)  treatment.  Similarly,  the  transcriptional  program  activated  through  the  PI3K/AKT  pathway  in  MCF7  cells 
expressing  a  constitutively  active  AKT  (CA-AKT)  is  dependent  on  a  unique  ERa  recruitment  pattern. 


in  ERa-binding  sites  in  breast  cancer  cells  behave  in  a 
distinct  manner.  Indeed,  instead  of  being  co-recruited 
with  ERa  following  estrogen  stimulation,  these 
transcription  factors  are  bound  at  the  hasal  state 
and  block  ERa  recruitment  abrogating  estrogen 
growth-promoting  properties  (Holmes  et  al.  2008). 
Since  LEF-1  and  Nkx3-1  can  associate  with  the  histone 
deacetylase  HDACl,  increased  chromatin  conden¬ 
sation  is  thought  to  be  fundamental  to  block  ERa 
recruitment  (Holmes  et  al.  2008).  Therefore,  by 
dehning  the  cistromes  of  ERa  and  AR,  the  intricate 
interplay  between  transcription  factors  and  their 
network  of  coregulatory  proteins  taking  place  at  the 
chromatin  is  gradually  being  revealed. 

Pioneer  factors  as  mediator  of  lineage- 
specific  transcriptional  programs 

The  FKH  motif  is  an  additional  motif  highly  enriched 
in  both  ERa  and  AR  cistromes  (Carroll  et  al.  2005, 
2006,  Laganiere  et  al.  2005,  Wang  et  al.  2007).  The 
forkhead  family  member  FoxAl  (HNF3a)  is  a  key 
partner  for  ERa  and  AR  transcriptional  activity  in 


breast  and  prostate  cancer  respectively,  recognizing  the 
EKH  motif.  It  was  first  characterized  as  a  pioneer  factor 
in  liver  tissue  (Gualdi  et  al.  1996,  Cirillo  et  al.  1998, 
Bossard  &  Zaret  2000).  More  recently,  its  ATP- 
independent  chromatin-remodeling  activity,  dis¬ 
tinguishing  it  from  the  classical  SWI/SNF  complex, 
has  shown  to  be  central  for  ERa  recruitment  in  breast 
cancer  cells  (Carroll  et  al.  2005,  Laganiere  et  al.  2005, 
Eeckhoute  et  al.  2006),  while  it  was  found  to 
physically  interact  with  AR  in  prostate  cancer  cells 
(Gao  et  al.  2003,  Wang  et  al.  2007).  Present  on  the 
chromatin  at  the  basal  state,  FoxAl  is  found  at  more 
than  60%  of  ERa-  and  AR-binding  sites  driving  the 
transcriptional  response  in  breast  and  prostate  cancer 
cells  respectively  (Lupien  et  al.  2008).  In  fact,  through 
its  chromatin-remodeling  activity,  EoxAl  allows  for 
the  opening  of  specific  genomic  regions  in  the  absence 
of  hormone  (Carroll  et  al.  2005,  Laganiere  et  al.  2005, 
Eeckhoute  et  al.  2006).  Hence,  under  hormonal 
stimulation,  ERa  and  AR  are  recruited  to  FoxAl 
sites  harboring  permissive  sequences  such  as  EREs  and 
AREs  (Lupien  et  al.  2008,  Fig.  3).  In  accordance  with 
its  predominant  role  in  ERa  signaling,  EoxAl  is 
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Figure  2  Chromatin  architecture  reprogramming  in  castration-resistant  prostate  cancer  cells.  Similar  to  ERa  in  breast  cancer  cell 
lines,  AR  is  recruited  to  a  fraction  of  its  putative  binding  sites  in  androgen-dependent  prostate  cancer  cells.  In  castration-resistant 
prostate  cancer  cells,  the  genome-wide  AR  recruitment  pattern  is  altered.  This  is  dependent  on  the  reprogramming  of  the  chromatin 
architecture  typified  by  de  novo  methylation  of  lysine  4  on  histone  H3  {H3K4me). 


typically  highly  expressed  in  ERa-positive  primary 
hreast  tumors  and  is  an  important  marker  of  breast 
cancer  subtype  and  prognosis  (Habashy  et  al.  2008, 
Thorat  et  al.  2008).  FoxAl  is  also  highly  expressed  in 
prostate  cancer  where  it  is  believed  to  contribute  to  the 
establishment  of  specihc  gene  expression  programs 
(Mirosevich  et  al.  2006).  However,  the  comparison  of 
FoxAl  cistromes  between  breast  and  prostate  cancer 
cells  reveals  its  cell-type-specific  recruitment  (Lupien 
et  al.  2008).  Indeed,  less  than  40%  of  FoxAl-binding 
sites  are  shared  between  these  two  cell  lines  supporting 
the  notion  that  FoxAl  is  recruited  in  a  lineage-specific 
fashion.  Importantly,  because  FoxAl  guides  the 
recruitment  of  transcription  factors,  such  as  ERa  and 
AR,  lineage-specific  transcriptional  programs  depen¬ 
dent  on  these  transcription  factors  are  directly  affected 
by  FoxAl 's  cell-type-specific  recruitment  (Fig.  3). 

Epigenetic  signatures  define  lineage- 
specific  functional  regulatory  elements 

The  requirement  for  lineage-specific  recruitment  of  the 
pioneer  factor  FoxAl  highlights  the  importance  of 
understanding  how  such  differential  recruitment  takes 
place.  Recently,  specific  epigenetic  signatures  dis¬ 
tinguishing  non-promoter  from  promoter-regulatory 


elements  have  been  reported  (Santos-Rosa  et  al. 
2002,  Ng  et  al.  2003,  Schneider  et  al.  2004,  Schubeler 
et  al.  2004,  Bernstein  et  al.  2005,  Pokholok  et  al. 
2005,  Heintzman  et  al.  2007).  This  signature, 
characterized  by  different  methylation  states  (mono-, 
di-,  or  tri-)  of  lysine  4  on  histone  H3  (H3K4mel,  me2, 
or  me3),  was  also  found  associated  with  distinct 
chromatin  regions  permissive  for  transgene  expression 
(Yan  &  Boyd  2006).  Specifically,  H3K4mel  and 
H3K4me2  were  associated  with  non-promoter-regulat- 
ory  elements,  while  H3K4me3  was  found  at  promoter 
regions  (Heintzman  et  al.  2007).  Furthermore,  regions 
enriched  in  H3K4me2  or  me3  associate  with  DNase  1 
hypersensitivity,  a  marker  of  active-regulatory  regions 
(Xi  et  al.  2007).  Therefore,  this  supports  the  notion  that 
H3K4me2  and  H3K4me3  are  specific  for  functional 
non-promoter  and  promoter-regulatory  elements 
respectively. 

In  agreement,  the  cell-type-specific  cistromes  for  the 
pioneer  factor  FoxAl  in  breast  and  prostate  cancer 
cells  are  dependent  on  H3K4mel/me2  distribution 
(Lupien  et  al.  2008).  In  reality,  H3K4mel/me2  defines 
in  a  lineage-specific  manner  through  which  regulatory 
elements  are  able  to  recruit  FoxAl  (Fig.  3).  Indeed, 
removal  of  this  epigenetic  signature  through  over¬ 
expression  of  the  lysine  demethylase  KDMl/LSDl 
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Figure  3  Lineage-specific  distribution  of  histone  H3  methylation  on  lysine  4  guides  FoxAl  recruitment.  Differentiated  cells  are 
characterized  by  a  unique  distribution  of  epigenetic  marks.  FoxAl  will  specifically  be  recruited  to  genomic  regions  harboring  the 
forkhead  motif  (FKFI)  marked  by  H3K4me1/me2.  Through  its  chromatin-remodeling  activity,  neighboring  chromatin  will  be  further 
opened  and  accessible  to  other  transcription  factors  such  as  ERa  and  AR  in  breast  and  prostate  cancer  cells  respectively.  By 
contrast,  FKH  regions  lacking  H3K4me1/me2  will  typically  associate  with  H3K9me2  and  be  deprived  of  FoxAl  recruitment.  Hence, 
the  lineage-specific  distribution  of  H3K4me1/me2  and  H3K9me2  guides  FoxAl  binding,  which  in  turn  restricts  the  recruitment  of 
other  transcription  factors. 


prevents  FoxAl  recruitment  (Lupien  et  al.  2008,  Wang 
et  al.  2009).  This  signature  is  also  characteristic  of  the 
ERa  cistrome  not  overlapping  with  FoxAl  (M  L  and 
M  B  unpublished  data).  In  fact,  lineage-specific  ERa 
cistromes  correlate  with  the  H3K4mel/me2  distri¬ 
bution.  Indeed,  ERa-binding  sites  specihc  to  breast 
cancer  or  osteosarcoma  cells  relate  to  the  unique 
distribution  of  H3K4mel/me2  in  these  respective  cell 
lines,  regardless  of  FoxAl  status  (Krum  et  al.  2008). 
Similarly,  the  recruitment  of  AR  to  novel  sites  in 
castration-resistant  prostate  cancer  cells  is  dependent 
on  the  de  novo  H3K4  mono-  and  dimethylation 
(Wang  et  al.  2009).  Removal  of  H3K4  mono-  and 
dimethylation  through  KDMl  overexpression  in  this 
model  also  suppressed  FoxAl  and  AR  recruitment 
(Wang  et  al.  2009). 

Similar  to  the  role  of  FoxAl  as  a  pioneer  factor  that 
translates  the  H3K4mel/me2  signature  at  non-promoter- 
regulatory  elements,  specihc  chromatin-remodeling 


components  are  recruited  to  H3K4me3-marked 
promoters.  The  ATP-dependent  chromatin-remodeling 
enzyme  CHDl  and  the  ATPase  SNF2H  are  recruited  to 
H3K4-methylated  promoters  (Santos-Rosa  et  al.  2003, 
Flanagan  etal.  2005,  Sims  etal.  2005).  This  suggests  that 
epigenetic  marks  are  insufficient  for  transcription  factor 
recruitment.  Therefore,  it  appears  that  the  interplay 
between  epigenetic  marks  and  chromatin-remodeling 
factors  is  required  to  open  chromatin  in  specihc  genomic 
locations  to  guide  transcription  factor  recruitment  both  at 
promoter  and  non-promoter-regulatory  elements. 

Understanding  how  the  epigenetic  signatures,  such 
as  the  methylated-H3K4-based  signature,  are  estab¬ 
lished  in  the  course  of  normal  and  disease  development 
is  of  central  interest.  To  date,  up  to  ten  histone 
methyltransferases  specihc  to  H3K4  have  been 
characterized  and  a  growing  number  of  histone 
demethylases  are  being  identihed  (Christensen  et  al. 
2007,  Eissenberg  et  al.  2007,  Iwase  et  al.  2007, 
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Klose  et  al.  2007,  Lee  et  al.  2007,  Ruthenburg  et  al. 
2007,  Seward  et  al.  2007,  Shi  &  Whetstine  2007, 
Tahiliani  et  al.  2007,  Yamane  et  al.  2007).  Therefore, 
the  methylation  state  of  H3K4  appears  to  be  under  tight 
regulation.  Previous  reports  have  indicated  that  the 
distribution  of  H3K4me2  is  established  early  during 
differentiation  (Chambeyron  &  Bickmore  2004). 
Similarly,  work  on  promoter  regions  has  revealed  a 
progression  from  a  broad  to  a  more  restricted 
distribution  for  H3K4me3  in  the  course  of  differen¬ 
tiation  (Guenther  et  al.  2007,  Mikkelsen  et  al.  2007). 
Interestingly,  the  bivalent  state  of  histone  modifi¬ 
cations  found  at  promoters  relates  to  the  transitional 
state  of  poised  to  active  or  repressed  promoters. 
Indeed,  in  the  course  of  differentiation,  promoters 
harboring  both  H3K4me3  and  H3K27me3  are  enriched 
for  H3K4me3  when  the  associated  genes  are  expressed 
and  for  H3K27me3  when  the  genes  are  silenced 
(Azuara  et  al.  2006,  Bernstein  et  al.  2006,  Guenther 
et  al.  2007,  Mikkelsen  et  al.  2007).  Although  there  is 
no  clear  indication  of  a  similar  signature  at  non- 
promoter-regulatory  elements,  H3K9me2  has  recently 
been  found  enriched  at  inaccessible  promoter-distant 
regulatory  elements  (Lupien  et  al.  2008).  Furthermore, 
H3K4mel/me2  and  H3K9me2  occurred  together  on 
more  condensed  FoxAl-binding  sites  as  defined  by 
DNasel  hypersensitivity  and  Formaldehyde-Assisted 
Isolation  of  Regulatory  Elements,  an  additional  method 
that  delineates  DNA  accessibility  (Giresi  et  al.  2007, 
Eeckhoute  et  al.  2009).  Therefore,  additional  studies 
are  needed  to  reveal  fundamental  components  of  the 
role  played  by  chromatin  structure  in  transcriptional 
regulation  as  well  as  the  contribution  transcriptional 
events  may  have  on  epigenetic  components. 

Conclusion 

The  wealth  of  information  derived  from  cistrome- 
based  studies  is  already  revealing  core  concepts  of 
transcriptional  regulation.  Recently,  a  predictive  model 
based  on  the  ERa  and  EoxAl  cistromes  from  breast 
cancer  cells  as  well  as  the  cistrome  of  the  insulator 
protein  CCCTC-binding  factor  (CTCE)  has  predicted 
up  to  70%  of  estrogen-regulated  genes  (Chan  &  Song 
2008).  Pursuing  this  analysis  to  coregulatory  factors 
and  epigenetic  components  should  reveal  intricate 
mechanisms  fine  tuning  the  actions  of  ERa  and  AR. 
Considering  the  increasingly  recognized  role  of 
epigenetic  components  in  cancer  development  and 
progression,  a  better  understanding  of  their  function  in 
transcriptional  regulation  will  prove  fundamental  in  the 
elaboration  of  novel  therapeutic  strategies  to  breast, 
prostate,  and  other  cancers. 
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